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Methyl transferases, nucleic acid molecules encoding methyl - 
transferases, their recombinant expression and uses ther of 

The present invention relates to proteins which are capable 
of functioning as methyltransf erases. More, specifically, 
the present invention relates to methyltransferases which are 
capable of carrying out at least one of the following 
reactions: the conversion of glycine to sarcosine (N- 
methylglycine) , sarcosine to N,N-dimethyl glycine and N,N- 
diraethyl glycine to betaine (N, N, N- 1 rime thy Igly cine) in the 
presence of a methyl group donor. Furthermore, the present 
invention relates to nucleic acid molecules encoding such 
methyltransf erase proteins, recombinant organisms which are 
capable of expressing said nucleic acids as well as the use 
of said recombinant organisms. 

Technological background 

Betaine (N,N,Nrtrimethylglycine) is a quaternary ammonium 
compound which can be found in many micro-organisms, animals 
and plants. Betaine is. synthesized or accumulated in living 
cells in response to abiotic stress (salinity, desiccation or 
low temperatures) (Mc Gue and Hanson, 1990; Csonka, 1989; 
Yancey et al., 1982; Wyn Jones et al., 1977; Gorham, 1995; 
Bohnert and Jensen, 1996) . Due to its physical properties, 
betaine is an osmolyte and thus it is able to restore and 
maintain osmotic balance of living cells. In addition, it has 
been demonstrated that betaine stabilizes and protects cell 
membranes (Coughlan and Heber, 1982) and other macromolecules 
(i.e. enzymes) in the cell (Papagiorgiou and Murata, 1995). 

Betaine is synthesized by a number of microbes. In addition, 
these microbes are usually capable of accumulating betaine or 
a precursor, choline, from the culture medium (Boch et al., 
1994; Perroud and Rudulier, 1985; Kempf and Bremer, 1995; 
Glaasker et al., 1996; Peter et al., 1996; and Kappes et al., 
1996) . Practically all halophilic bacteria are able to use 



■ / 

WO 00/11142 PCT/EP99/06037 

2 ' 

betaine as at least one of their osmoiytes in order to 
survive, in the high ionic strength environment.^ 

Also, many plants synthesize betaine in response to drought' 
or salinity (Rhodes and Hanson, 1993; Gorham, 1995). A . 
correlation between cold tolerance and betaine synthesis has 
been demonstrated (Holmberg, N. 1996, Kishitani et al., 1994; 
Nomura et al., 1995). Intracellular concentrations as low as 
1 mM have shown to give protective effects in plants 
(Ishitani et al . , 1993). Increasing the betaine content of 
plants by genetic engineering or plant breeding has shown to 
result in more salt or cold tolerant transgenic plants J A 
betaine biosynthesis pathway has been introduced, e.g. in 
tobacco (Lilius et al. 1996; JP. 8-266179; JP 8-103267) 
Arabidopsis (Hayashi et al., 1997) and rice (Nakamura et al . , 
1997; Guo, 1997). However, the results obtained are very 
preliminary and only phenotypic effects (salt or cold , 
tolerance) have been demonstrated. Very little is known with 
respect to the relationship between the concentration 
dependence of the intracellular betaine and stress tolerance. 

It has recently been shown that plants can acquire better 
stress tolerance by accumulating exogenou sly applied betaine. 
Foliar spraying of bet'aine in a specific phase of growth has 
been shown to increase the productivity of many crops. 

(Makela.et al., 1996; Agboma, 1997). Typically, 15 % crop 
yield improvements have been obtained with many species 

(sorghum, maize, soybean, cotton, potato, tomato) under 
conditions of salinity and drought. The detailed 
physiological mechanism of betaine action is not fully 
understood, but it is known that betaine stimulates 
photosynthesis and decreases photorespiration. 

In animal cells, betaine also acts as a methyl group donor. 
The most important function is to the ability to methylate 
homocysteine back to methionine, which can then further be 
used as a methyl group donor when metabolized to S-adenosyl 
methionine (SAM) . It has been demonstrated that orally 
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administered betaine relieves diarrhea and dehydration in 
many animals and inhibits invasion of gut epithelium by 
coccidia parasite (Ferket, 1994 ; Augustine and McNaughton, 
1996) . As a methyl group donor betaine has been shown to be 
lipotropic, thus decreasing the amount of fat in chicken meat 
(Saunderson and MacKinlay, 1990; Barak et al., 1993). 

The most extensively studied betaine biosynthesis pathway is 
the two-step oxidation reaction of choline to. betaine via 
betaine aldehyde. This metabolic pathway has been 
demonstrated to exist in number of microbes, plants and 
animal cells. The choline-betaine pathway of^E. coli (Lamark 
et al., 1991) and Pseudomonas aeruginosa (Nagasawa et al., 
1976) comprises an oxygen-dependent choline dehydrogenase, 
which catalyzes the oxidation of both choline to betaine 
aldehyde and betaine aldehyde to betaine . The E . coli choline 
dehydrogenase gene has been cloned and sequenced. In 
addition, it has successfully been expressed in'many 
heterologous organisms (Nomura et al., 1995; Lilius et al. , 
1996; Hayashi et al., 1997; Nakamura et al., 1997; Guo, 
,1997) . The enzyme is membrane-bound. The reaction is 
independent of soluble cof actors and electron- transfer 
linked. Choline dehydrogenase genes from sinorhizobium 
meliloti (Pocard et al., 1997) and Bacillus (Boch et al., 
1996) have also been isolated. 

Alternatively, the oxidation of choline to betaine can be 
catalyzed by a choline oxidase found for example in some 
Corynebacteria, Brevibacterium and Alcaligenes species 
(Nakanishi and Machida, 1981; Kojima et al. , 1987) . The 
enzyme has been shown to also exist in some fungal strains 
(Tani et al., 1979) . The choline oxidase of Arthrobacter 
pascens has been cloned and successfully expressed' in E. coli 
(Rozwadovski et al., 1991) or Synechococcus (Deshnium et al., 
1995) . The reaction uses molecular oxygen as the hydrogen 
acceptor and hydrogen peroxide is formed in the reaction. 
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In plants, the synthesis of betaine pathway has been 
investigated in detail in sugar beet and spinach (McCue et 
al., 1992; Weretilnyk and Hanson, 1989) . The first step is 
catalyzed by a choline mono - oxygenase . In plants the enzyme 
is located in the chloroplast stroma (Brouquisse et al., 
1989) . The gene has recently been cloned from spinach 
(Rathinasabapathi et al., 1997). 

The second oxidation step from betaine aldehyde to betaine 
may also be catalyzed by a betaine aldehyde dehydrogenase. A 
betaine aldehyde dehydrogenase has also been found in a 
number of organisms (?seudomonas aeruginosa (30)). Also some 
plants have this enzyme (Hanson et al., 1985). In plants., the 
enzyme has been demonstrated to. have wider substrate 
specificity and thus it also catalyzes other reactions 
(Trossat et al., 1997) . The gene has been cloned from E. coli 
(Lamark et al., 1991), spinach (Weretilnyk and Hanson, 1990) 
and also from barley (Ishitani et al., 1995). The E. coli 
gene has also successfully been expressed in transgenic 
tobacco (Holmstrdm et al . , 1994). 

Ability to synthesize betaine de novo is rare among aerobic .-, 
heterotrophic eubacteria. Of all strains examined, only 
Actinopolyspora halophila and a related isolate have been 
shown to produce betaine from simple carbon sources (Severin 
et al., 1992,- Galinski, 1993) . There are few examples of 
other organisms which have been shown to be able to 
synthesize betaine from simple carbon sources. The data is 
usually based on metabolic studies using NMR. Roberts and 
co-workers (1992) have shown that some archaebacterial 
methanogens (Methanohalophilus) could synthesize betaine from 
glycine via a methylation reaction. However, no enzymes 
catalyzing the reactions have successfully been isolated from 
these organisms. A similar pathway has been suggested to 
exist in Ectothiorhodospira halochloris (Galinski and Truper, 
1994) . 
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Attempts to. characterize the glycine biosynthesis pathway in 
Ectothiorhodospira halochloris have been made 
(Tschichholtz-Mikus, 1994). According to the hypothesis 
proposed by this author, betaine would be synthesized from 
glycine by three me thyltransf erases, each specific for one 
methylation reaction. The isolation of the purified ' enzymes 
was, however, not successful and only one enzyme, specified 
as dimethyl glycine me thyltransf erase was partially purified. 
In addition, the methodology used to study reactions was 
rather simple and as demonstrated herein, the results 
obtained by the group differ from those obtained by the 
present inventors . 

Betaine is used as feed additive in feed industry. Thus, 
transgenic plants producing high amounts of betaine in vivo 
would have better nutritional value'. Feed crops (e.g. maize 
or -soybean)- producing sufficient amounts of betaine could 
therefore directly be used in feed without the need of 
betaine supplementation. , -.. 

Although betaine is synthesized by many plants, there are 
several commercially important crops such as potato, rice, 
tomato and tobacco which do not accumulate betaine. For 
example, Bulow and co-workers (1995) were the first to 
demonstrate that expression of the E. coli choline 
dehydrogenase in tobacco improves the salt tolerance and 
freezing tolerance (Holmberg, 1996) of transgenic potato and 
tobacco due to endogenously synthesized betaine. The same 
phenomenon has been demonstrated also with Arabidopsis 
(Hayashi et al., 1997) and rice (Nakamura et at., 1997; Guo, 
1997) . Therefore, expression of the methyltransf erases in 
plants can facilitate stress tolerance and improve the 
productivity of the plants when grown under conditions of 
water stress or freezing and cold temperatures. 



Moreover, betaine has shown to induce pathogenesis -related 
protein expression in plants (Xin et al., 1996) as well as 
increasing the resistance of plants to attack by pathogeny 
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fungi or nematodes (Blunden et al., 1996; Wu et al. ,1997) 
and may decrease the incidence of nematode (e.g. Meloidogne 
javanica. and M. incognita) attack in plants. Therefore, 
transgenic plants producing endogenous betaine, can be more 
resistant to fungal pathogens. Moreover, the betaine 
synthesis in the plants may be coupled to the systemic 
resistance genes which are induced when the plant is attacked 
by pathogens. 

Endogenously synthesized betaine may also affect the 
viability of microbes and therefore it would improve their 
performance in various biotechiiical processes. For, instance/ 
in high cell density fermentation or immobilized cell 
systems, the production microbes are subjected to , 
considerable environmental stress. Betaine has successfully 
been used in fermentation media to increase the product yield 
in amino acid production.. For instance, betaine has shown to 
relieve stress and improve yield of lysine producing 
Brevibacterium lactof ermentum (Kawahara et al . , 1990) . 
Betaine is also commercially sold for the purpose 
(Nutristim®, Cultor Corp )< . Thus, endogenously synthesized 
betaine can improve productivity in biotechnical processes 
where the cells are subjected to abiotic stress. 

Microbes also suffer from stress when subjected to high 
temperatures or when cells are freeze-dried or frozen. The 
viability of yeast or bacterial cells may be dramatically 
reduced in these processes used in e.g.* frozen dough 
manufacturing or preservation of lactic acid bacterium 
starters. Therefore, it would be highly advantageous if one 
could improve the viability of microbes subjected to 
freeze- thaw or f reeze-drying processes by accumulating 
betaine inside the cells. In addition, it has been shown 
that exogenously applied betaine improves the viability of 
microbes in extreme pH (Smirnova and Oktyabrsky, 1995; 
Chambers and Kunin, 1985) - 
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Improved performance of beneficial, probiotic microbes 
organisms in animal digestive tract can be utilized in animal 
nutrition. Thus, introduction of a betaine synthesis pathway 
can improve the stress tolerance of "probiotic" lactic acid 
microbes which efficiently bind to gut epithelium in cells, 
providing a way to balance the microbial population in the GI 
tract and to improve pathogen resistance. 

Betaine has been shown to stabilize proteins in the cells. 
For example, it has also been demonstrated that cytoplasmic 
accumulation of betaine will reduce the formation of 
inclusion bodies (Blackwell and Horgan, 19.91; Bhandari and 
Gowrishankar, 1997) which is a problem often encountered when 
heterologous proteins are expressed in E. coli. Thus, 
co-expression of the genes of betaine biosynthesis with the 
protein of interest should result in better solubility of the 
heterologous, protein and reduce the amount of inclusion 
bodies . 

The use of the glycine methylation pathway may have a number, 
of advantages over the oxidative synthesis from choline. 
Glycine is synthesized by practically all organisms and as an 
amino acid, this metabolite is present in high concentrations 
in the cells. In contrast, the availability of intracellular 
choline may limit betaine biosynthesis. In addition, the 
metabolism and formation of glycine in cells is known and the 
genes of this metabolic pathway have been cloned, thus 
allowing the engineering of the glycine pathway. • 

Based on the above, an object of the present invention is to 
provide proteins which are capable of acting in a 
biosynthetic pathway from glycine to betaine as well as 
methods for the purification and production of said proteins . 

A further object of the present invention is to provide 
nucleic acid molecules which, when transformed into a host 
organism, encode proteins which are capable of acting in a 
biosynthetic pathway from glycine to betaine. 
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A further object is to provide recombinant microorganisms 
/which are capable of .expressing one or more proteins which 
are capable of acting in a biosynthetic pathway from glycine 
to betaine for the production of betaine and precursors 
thereof. 

Furthermore, an object of the present invention is to provide 
recombinant plants which express one or ; more proteins which 
are capable of acting in a biosynthetic pathway from glycine 
to betaine for the production of betaine and precursors 
thereof. 

Furthermore, an object of the present invention is to provide 
a method for the production of betaine and precursors 
thereof, for example sarcosine and dimethyl glycine, in 
recombinant - organisms . - ~ 

A further object is to provide recombinant organisms which 
have an increased concentration of intracellular betaine and 
are useful in the fields of recombinant heterologous protein 
production, agriculture, etc. 

A further object of . the^present invention is to provide 
nucleic acid probes and method for identifying and cloning 
genes which encode proteins which are capable of 
participating in a biosynthetic pathway from glycine to 
betaine* 

A further object of the present invention is to provide 
methods for improving the general growth and/or productivity 
of an organism including enhancing stress o tolerance, for 
example, salt tolerance, freezing tolerance and cold 
tolerance, enhancing resistance to drought, water stress and 
attack by pathogens in organisms. 

Furthermore, an object of the present invention is to provide 
recombinant microorganisms which have improved viability in 
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culture, enhanced pH tolerance in culture, result, in 
decreased inclusion body formation when expressing a 
heterologous protein, result in increased solubility, 
stability and/or yield of a heterologous protein expressed in 
said organism. 

i 1 

Moreover, it is an object , of the present invention to. provide 
an animal feed and animal feed ingredient having enhanced 
nutritional value. 

Other objects of the present invention will be apparent to 
the skilled person based on the information provided herein. 



fjyT pmarv o f ^ Invention 

The inventors have identified, isolated and purified proteins 
which are ,capable of carrying out at least one of the 
following reactions in a metabolic pathway from glycine to 
betaine: the conversion of glycine to sarcosine (N- 
methylglycine) , sarcosine to N,N-dimethyl glycine and N, N- 
dimethyl glycine to betaine (N,N,N-trimethylglycine) in the 
presence of a methyl group donor. These proteins are 
designated herein as methyltransf erases based on their 
ability to transfer a methyl group from a methyl group donor 
to a methyl group acceptor. 

The above mentioned reactions are individual steps in a 
three- step methylation reaction pathway of glycine to) betaine 
in certain microorganisms, for example, Ec£othiorhodospira 

halochloris and Actinopolyspora halophila. 

For example, a methyltransf erase of the present invention ^ 
(designated hereinafter as glycine -sarcosine 
methyl transferase or GSMT) has been isolated from 
Ectothiorhodospira halochloris and Actinopolyspora halophila 
which is capable of catalyzing methylation reactions that 
convert glycine to dimethyl glycine, i.e. convert glycine to 
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sarcosine (N-methyl glycine) and sarcosine to dimethyl 



glycine (N, N-dimethyl glycine) . 

As a further example, a methyl transferase according to the 
present invention (designated hereinafter as sarcosine- 
dimethylglycine methyltransferase or SDMT) has been isolated 
from Ectothiorhodospira halochloris and Actinopolyspora 
halophila which is capable of catalyzing methylation 
reactions that convert sarcosine to betaine, i.e. convert 
sarcosine (N-methyl glycine) to dimethyl glycine (N,N- 
dimethyl glycine) and dimethyl glycine to betaine. 

The activities of GSMT and SDMT are described below. 



glycine 



glycine- sarcosine 
methyltransferase 
(GSMT) 




SAM 




N-methyl glycine 
(sarcosine) 



gly c ine - s arco s ine 
methyltransferase 
(GSMT) 





sarcosine- 
dimethyl glycine 
methyltransferase 

(SDMT) 



N,N- dimethyl glycine 





sarcosine- 
dimethyl glycine . 
methyltransferase 



SAH 



(SDMT) 



N,N f N-trimethyl glycine 
(betaine) 
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The methyl transf erases of the present invention are capable 
of utilizing S-adenosyl methionine (hereinafter also referred 
to as SAM) as a methyl group donor in the above reactions . 

Brief Description of the Figures 

Figure 1. Formation of methylatibn products from glycine, 
sarcosine and dimethyl glycine substrates using the A. 
halophila cell extract. The retention times of the standards, 
are shown by arrows. 

Figure 2. Analysis' of purified methyl transferases on SDS- 
PAGE. A) E. halochloris GSMT; B) A. halophila SDMT. Lane 1, 
Purified protein sample; lane 2, molecular weight marker. 

Figure 3. The determination of the isoelectric point of A. 
halophila SDMT by isoelectric focusing. 

V 

Figure 4. The pH- optimum of A. halophila SDMT. (•) Activity 
on sarcosine; (A) Activity on dimethyl glycine 

Figure 5. The temperature dependence of A. halophila SDMT 
activity. (•) Activity on sarcosine; (*) Activity on 
dimethyl glycine. 

Figure 6. In vitro synthesis of betaine by using the purified 
E. halochloris GSMT and A. halophila SDMT enzymes. The 
retention times of the standards are shown by arrows. 

Figure 7. The schematic structure of the betaine operons of 
A. halophila and E* halochloris. GSMT; glycine sarcosine 
methyltransf erase. SDMT; sarcosine dimethyl glycine- 
methyltrarisf erase. SAMS; S-adenosyl methionine synthase. 

Figure 8. The nucleotide and amino acid sequence of the E. 
halochloris betaine operon. The arrows indicate the amino 
acids encoding GSMT, SDMT and SAMS. The underlined regions 
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indicate regions which are hybridized with the primers used 
to construct the expression vectors in heterologous 
organisms. The * indicates a stop codon. 

Figure ; 9 . The nucleotide and amino acid sequence of the A. 
halophila betaine operon. The underlined regions indicate 
regions which are hybridized with the primers used to 
construct the expression vectors in heterologous organisms. 
The * indicates a stop codon. 

Figure 10. Schematic presentation of the expression plasmid 
used in expression of the methyl transf erases . The ■ insert was 
ligated to vector digested with Ncol/Bglll. 

Figure 11. The growth curves of E. coli transf ormants 
carrying the E. halochlorisGSWT gene (EGSM) y Transf ortnant 
carrying only the cloning vector (PQE-60) was used as the 
control. 

Figure 12. The growth curves of E. coli transf ormants 
carrying the E. halochloris GSMT and SDMT genes (EhFD) . 
Transf ortnant carrying only the cloning vector (PQE-60) was 
used as the control. 

Detailed Description of the Invention 

One embodiment of the present invention provides a 
methyltransf erase, for example GSMT, capable of catalyzing 
the conversion of glycine to sarcosine (N-methyl glycine) 
and/or the conversion of sarcosine to dimethyl glycine (N,KT- 
dimethyl glycine) . Preferably, said methyltransf erase 
comprises an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence as depicted in SEQ ID NO:2 and 
an amino acid sequence as depicted in SEQ ID NO:6, 

(b) a fragment of an amino acid sequence as defined in (a) 

and . * 
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(c) a derivative of an amino acid sequence as. defined in (a) 
and (b) . 

A fragment of an amino acid sequence as depicted in SEQ ID 
NO: 2 or SEQ ID NO: 6 is designated as any fragment of an amino 
acid sequence as depicted in SEQ ID NO: 2 or SEQ ID NO: 6 which 
is capable of catalyzing the conversion of glycine to 
sarcosine (N-methyl glycine) and/or the conversion of 
sarcosine to dimethyl glycine (N,N-dimethyl glycine) 

A derivative of an amino" acid sequence as depicted in SEQ ID 
NO:2 or SEQ ID NO: 6 is designated as any mutation, deletion, 
addition, substitution, insertion or inversion of one or more 
amino acids in the of amino acid sequence as depicted in SEQ 
ID NO: 2 or SEQ ID NO: 6 which is capable of catalyzing the 
conversion of glycine to sarcosine (N-methyl glycine) and/or 
the conversion of sarcosine to dimethyl glycine (N, N- dimethyl; 
glycine). Preferably, a derivative of a methyl transferase 
which is capable of catalyzing the conversion of glycine to 
sarcosine (N-methyl glycine) and/or the conversion of 
sarcosine to dimethyl glycine (N,N-dimethyl glycine) has 
about 60 % homology, preferably about 70 % homology, more 
preferably about 80 % homology, and most preferably about 90 
% homology to the corresponding amino acid sequence depicted 
in SEQ ID NO: 2 or SEQ ID NO: 6. Preferably, when the above 
amino acid sequences has one or more mutations, J* 
substitutions, additions and/ or insertions, the amino acids 
constituting these changes are selected 'from the 20 standard 
naturally- occurring amino acids found in. proteins, i.e. Ala, 
val, Leu, He, Pro, Phe, Trp, Met, Gly, Ser, Thr, Cys, Tyr, 
Asn, Gin, Asp, Glu, Lys, Arg and. His. Preferably, the 
mutation(s) and/or substitution(s) are conservative, for 
example, Ala, Val, Leu, He,- Pro, Phe, Trp, or Met residue(s) 
are replaced with one of these amino acids, Gly, Ser, Thr, 
Cys, Tyr, Asn, Gin, residue (s) are replaced with* one of these 
amino acids, Asp or Glu are replaced with one of these amino 
acids and Lys, Arg or His are replaced with one of these 
amino acids . ,/ 
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In a preferred embodiment, a methyltransf erase according to 
the invention has the amino acid sequence depicted in SEQ ID 
NO:2. 

In another preferred embodiment, a 'methyltransf erase 
according to the invention has the amino acid sequence , 
depicted in SEQ ID NO: 6. 

The methyltransf erase of the present invention capable. of. 
catalyzing the conversion of glycine fco^ sarcosine (N-methyl 
glycine) and/or the conversion of sarcosine to dimethyl 
glycine (N, N-dimethyl glycine) can exist in the form of an 
active enzyme or as a zymogen. The term -zymogen 1 designates 
a protein molecule or fragment or derivative thereof (as 
defined above) which is synthesized in an inactive form and 
is capable of being activated in vitro or in vivo by the 
chemical or enzymatic cleavage of one or more . peptide bonds. 
A preferred zymogen of the methyltransf erase of the present 
invention capable of catalyzing the conversion of glycine to 
sarcosine (N-methyl glycine) and/or the conversion of 
sarcosine to dimethyl glycine (N,Nt dimethyl glycine) 
comprises the amino acid sequence as depicted in SEQ ID NO: 2 
and SEQ ID NO: 3, wherein the N- terminus of SEQ ID NO: 3 is 
joined to the C- terminus of SEQ ID NO:2. 

A further embodiment of the present invention provides a 
methyltransf erase", for example SDMT, capable of catalyzing 
the conversion of sarcosine (N-methyl glycine) to dimethyl 
glycine (N, N-dimethyl glycine) and/or dimethyl glycine to 
betaine. Preferably, said methyl transferase comprises an 
amino acid sequence selected from the group consisting of: 

(a) an amino acid sequence as depicted in SEQ ID NO: 3 and 
an amino acid sequence as depicted in SEQ ID NO: 7, 

(b) a fragment of an amino acid sequence as defined in (a) 

and ' ' 

(c) a derivative of an amino acid sequence as defined in (a) 

and (b) . - 
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A fragment of an amino acid sequence as depicted in SEQ ID 
NO: 3 or SEQ ID NO: 7 is designated as any fragment of an amino 
acid sequence as depicted in SEQ! ID NO: 3 or SEQ ID NO: 7 which 
is capable of catalyzing the conversion of sarcosine (li- 
me thy 1 glycine) to dimethyl glycine and/or the conversion of . 
dimethyl glycine to betaine. 

A derivative of an amino acid sequence as depicted in SEQ ID 
NO: 3 or SEQ ID NO: 7 is designated as any mutation, deletion, 
addition, substitution, insertion or inversion of one or more 
amino acids, or combination thereof, in the of amino acid 
sequence as depicted in SEQ ID NO: 3 or SEQ ID NO: 7 which is 
capable of catalyzing the conversion of sarcosine (N-methyl 
glycine) to dimethyl glycine and/or the conversion of 
dimethyl glycine (N,N-dimethyl glycine) to betaine. 
Preferably, a derivative of a methyltransf erase which is 
capable of catalyzing the conversion of sarcosine (N-methyl 
glycine) to dimethyl glycine and/or the conversion of 
dimethyl glycine (N,N-dimethyl glycine) to betaine has about 
60 % homology, preferably about 70 % homology, more 
preferably about 80 % homology, and most preferably about 90 
% homology to the corresponding amino acid sequence depicted 
in SEQ ID NO: 3 or SEQ ID NO: 7. Preferably, when the above 
amino acid sequences has one or more mutations, 
substitutions, additions and/or insertions, 'the amino acids 
constituting these changes are selected from the 20 standard, 
naturally- occurring amino acids found in proteins, i.e. Ala, 
val, Leu, He, Pro, Phe, Trp, Met, Gly, Ser, Thr,'Cys, Tyr, 
Asn, Gin, Asp, Glu, Lys, Arg and His . Preferably, the 
mutation(s) and/ or substitution (s) are conservative, for 
example, Ala, Val, Leu, He, Pro, Phe, Trp, or Met residue(s) 
are replaced with one of these amino acids, Gly, Ser, Thr, 
Cys„ Tyr, Asn, Gin, residue (s) are replaced with one of these 
amino acids. Asp or Glu are replaced with one of these amino 
acids and Lys, Arg or His are replaced with one of these 
amino acids. 
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In a preferred embodiment, a methyltransferase according to 
the invention has the amino acid sequence depicted in SEQ ID 
NO: 3. 

In another preferred embodiment, a methyltransf erase 
according to the invention has the amino acid sequence 
depicted in SEQ ID NO: 7. 

The methyltransferase of the present invention capable of 
catalyzing the conversion of sarcosine (N-methyl glycine) to 
dimethyl glycine and/or the conversion of dimethyl glycine 
(N,N-dimethyl glycine) to betaine can exist in the form of an 
active enzyme or as a zymogen. The term 'zymogen' designates 
a protein molecule, fragment or derivative thereof (as 
defined above) which is synthesized in an inactive form and 
is capable of being activated in vitro or in vivo by the 
chemical or enzymatic cleavage of one or more peptide bonds . ; 
A preferred zymogen of the methyltransferase of the present 
invention capable of catalyzing the conversion of sarcosine 
(N-methyl glycine) to dimethyl glycine and/ or the conversion 
of dimethyl glycine (N,N-dimethyl glycine) to betaine 
comprises the amino acid sequence as depicted in SEQ ID NO: 2 
and SEQ ID NO: 3, wherein the N- terminus of SEQ ID NO: 3 is 
joined to the C- terminus of SEQ ID N0:2. 

The methyl transferases of the present invention, for example 
naturally occurring GSMT and SDMT or recombinantly produced 
GSMT and SDMT and fragments or derivatives thereof as well as 
zymogens of these naturally occurring or recombinant 
proteins, are preferably isolated to a state free from other 
proteins originating from the organisms from which they are 
isolated; more preferably, to a pure state, most preferably 
to a homogeneous state. 

Another aspect of the present invention provides an enzyme 
capable of catalyzing the synthesis of S-adenosyl methionine 
(SAM), i.e. S-adenosyl methionine synthase (hereinafter 
referred to as SAMS) which converts methionine to S-adenosyl 
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methionine in the presence of ATP. Preferably, said SAMS 
comprises an amino acid sequence selected from the group 
consisting of : 

(a) an amino acid sequence as depicted in SEQ ID NO: 4 and 
an amino acid sequence as depicted in SEQ ID NO: 8, 

(b) a fragment of an amino acid sequence as defined in (a) 
and 

(c) a derivative of an amino acid sequence .as defined in (a) 
and (b) . 

■ ■ > 

^ . ■ . . ■ 

A fragment of an amino acid sequence as depicted in SEQ ID 

.NO: 4 or SEQ ID NO: 8 is designated as any, fragment of an amino 

acid sequence as. depicted . in SEQ ID NO: 4 or SEQ ID NO: 8 which 

is capable of catalyzing the-conversion of S-adenosyl 

methionine from methionine and ATP (adenosine triphosphate) . 

A derivative of an amino acid sequence as depicted in SEQ ID 
NO: 4 or SEQ ID NO: 8 is designated as any mutation, deletion, 
addition, substitution, insertion or inversion of one or more 
amino acids, or combination thereof, in the of amino acid 
sequence as depicted in SEQ ID NO: 4 or SEQ ID NO: 8 which is 
capable of catalyzing the conversion of S-adenosyl methionine 
from methionine and ATP. Preferably, a derivative which is 
capable of catalyzing the conversion of methionine to S- 
adenosyl methionine has about 60 % homology, preferably about 
70 % homology, more preferably about 80 % homology, and most 
preferably about 90 % homology to the corresponding amino 
acid sequence depicted in SEQ ID N0:4 or SEQ ID NO : 8 . 
Preferably, when the above amino acid sequences has' one or 
more mutations, substitutions, additions and/or insertions, 
the amino acids constituting these changes are selected from 
the 20 standard naturally- occurring amino acids found in 
proteins, i.e. Ala, Val-, Leu, He, Pro, Phe,Trp, Met, Gly, 
Ser, Thr, Cys, Tyr, Asn, Gin, Asp, Glu, Lys, Arg and His. 
Preferably, the mutation (s) and/or substitution (s) are 
conservative, for example, Ala, Val,.Leu, lie, Pro, Phe, Trp, 
or Met' residue (s) are replaced with one of these amino acids, 
Gly, Ser, Thr, Cys, Tyr, Asn, Gin, residue (s) are replac d 
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with one of these amino acids, Asp or Glu are replaced with 
one of these amino >acids and Lys , Arg or His are replaced ^ 
with one of these amino acids. 

In a preferred embodiment, .the ( SAMS of the invention has the 
amino acid sequence depicted in SEQ ID NO: 4. ' 



i 



in another preferred embodiment; the SAMS of the invention 
has the amino acid sequence depicted in SEQ ID NO: 8. 

The SAMS of the present invention, for example naturally 
occurring SAMS or recombinantly produced SAMS and fragments 
or derivatives thereof, are preferably isolated to a state 
free from other proteins originating from the organisms from 
which they are isolated; more preferably, to a pure state, 
most preferably to a homogeneous state. 

The present invention also" relates T to a "nucleic- acid molecule 
which is capable of encoding a methyl transferase capable of 
converting glycine to sarcosine (N-methyl glycine) and/or 
sarcosine to dimethyl glycine (N , N- dimethyl glycine) . . 

In a preferred embodiment of the present invention, a nucleic 
acid molecule which encodes a methyltrahsf erase capable of . 
converting glycine to sarcosine (N-methyl glycine) and/or 
sarcosine to dimethyl glycine (N,N-dimethyl glycine) ; 
comprises a nucleotide sequence selected from the group 
consisting of: , 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ ID 

NO:l and 

a DNA sequence from nucleotide 221 to 1024 of SEQ ID 

N0:5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in (a) 
and (b) , and' 
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(d) a nucleotide sequence which is degenerate as a result of 
.the genetic code to any of the nucleotide sequences defined 
in (a) , (b)and (c) . 

A nucleotide sequence which hybridizes to a nucleic acid 
molecule comprising a DNA sequence from nucleotide 208 to 
1047 of SEQ ID NO:l or a DNA sequence from nucleotide 221 to 
1024 of SEQ ID NO: 5 is designated as any nucleotide sequence, 
for example DNA or RNA, preferably DNA, which hybridizes . 
under standard conditions to a DNA sequence from nucleotide 
208 to 1047 of SEQ ID N0:1 or a DNA sequence from nucleotide 
221 to 1024 of SEQ ID NO: 5 and encodes a methyl transferase 
capable of converting glycine to sarcosine (N-methyl glycine) 
and/or sarcosine to dimethyl glycine (N,N-dimethyl glycine). 
The term * standard conditions ' designates a standard 
procedure used in heterologous hybridization to screen for 
genes with enough sequence homology (Maniatis, 1989) . ■ 
Basically, the hybridization filters are washed first in low 
stringency conditions and the stringency is gradually 
increased (usually by increasing the washing temperature) in 
order to select the positive signals, from the background. 
For example, plaques or colonies of a gene bank to be 
screened can be transferred onto nitrocellulose membranes and 
hybridized with a PCR fragment, oligonucleotide or any other .. 
cloned DNA containing a fragment of the above gene of 
interest. The probe can be prepared for example by PCR as 
described in Example 6. Hybridization can be carried out at 
42°C in 50 mM Tris-HCl, pH 7.5, 10 mM EDTA, 1 M NaCl, 0.5% 
SDS, 0.1% sodium pyrophosphate, lOx Denhardt's solution 
(Maniatis, 1989), 100 MS herring sperm DNA and 125 fig/vxl 
polyA. The filters can be first, washed with low stringency 
conditions, for example, at 37°C using 3x SSC, 0.5% SDS and 
10% sodium pyrophosphate (Maniatis, 1989). The filters can 
then be exposed to x-ray film to monitor the number of 
positive clones . The washing temperature will be raised in 
2-5°C intervals up to a temperature at which the background 
becomes invisible and only a few positive plaques are 
obtained. The positive plaques or colonies can then be 
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purified and the DNA can be isolated for Southern blot 
hybridization to check the size of the cloned insert for , 
example: The cloned DNA obtained can be sequenced. On the 
basis of the sequence homology, it can be concluded that the 
'DNA contains the gene of interest. 

Preferably, *a nucleotide sequence which hybridizes to a 
nucleic acid molecule comprising a DNA sequence from 
nucleotide 208 to 1047 of SEQ ID NO : l or a DNA sequence from 
nucleotide 221 to 1024 of SEQ ID N0:5 has about 60 %, 
preferably 70 %, more preferably 80,% and especially 90 % 
homology to a nucleoside sequence corresponding to a DNA 
sequence comprising nucleotide 208 to 1047 of SEQ ID NO:l or 
a DNA sequence from nucleotide 221 to 1024 of SEQ. ID N0:5. 

A fragment of a nucleotide sequence from nucleotide 208 to ; 
1047 of SEQ ID NO:l or a nucleotide sequence from nucleotide 
221 to 1024 of SEQ ID NO: 5 is designated as any nucleic acid .,! 
fragment, for example DNA or RNA, preferably DNA, which 
encodes a methyltransf erase capable of converting glycine to 
sarcosine (N-methyl glycine) and/or sarcosine to dimethyl 
glycine (N,N-dimethyl glycine). . 

A preferred nucleic acid molecule according to the invention 
comprises the DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l. 

A further preferred nucleic acid molecule according to the 
invention comprises the DNA sequence from nucleotide 221 to 
,1024 of SEQ ID NO: 5 

Other preferred nucleic acid molecules according to the 
invention comprise the DNA sequence from nucleotide 221 to 
1867 of SEQ ID NO: 5 or the DNA sequence from nucleotide 208 
to 1902 of SEQ ID N0:1, the DNA sequence from nucleotide 221 
to 1867 of SEQ ID NO: 5 being more preferred. 
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The present invention also relates to a nucleic acid molecule 
which encodes a me thyl trans f erase capable of converting 
sarcosine (N-methyl glycine) to dimethyl glycine and/or 
dimethyl glycine (N, N- dimethyl glycine) to betaine. 

In a preferred embodiment of the present invention, a nucleic 
acid molecule which encodes a methyl transferase capable of 
converting sarcosine (N-methyl glycine) to dimethyl glycine 
and/or dimethyl glycine (N,N-dimethyl glycine) to betaine 
comprises a nucleotide sequence selected from the group- 
consisting Of : . 

(a) a DNA sequence from nucleotide 1048 to 1902 of SBQ ID 

NO:l. and 

a DNA sequence from nucleotide .1031 to 1867 of SBQ ID 

N0:5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a) , • 

(c) a fragment of a nucleotide sequence, as defined in (a) 
and (b) , and 

(d) a nucleotide sequence which is degenerate as a result of 
the genetic. code to any of the nucleotide' sequences defined 
in (a), (b)and (c) . - 

) 

i 

A nucleotide sequence which hybridizes to a nucleic acid 
ttiolecule comprising a DNA sequence from nucleotide 1048 to 
1902 of SEQ ID NO: 1 or a DNA sequence from nucleotide 1031 to 
1867 of SEQ ID NO : 5 is designated as any nucleotide sequence, 
for example DNA or RNA, preferably DNA, which hybridizes 
under standard conditions to a DNA sequence from nucleotide 
1048 to 1902 of SEQ ID NO:l or a DNA sequence from nucleotide 
1031 to 1867 of SEQ ID NO: 5 and encodes a methyltransf erase 
capable of converting sarcosine (N-methyl glycine) to 
dimethyl glycine and/or dimethyl glycine (N, N-dimethyl 
glycine) to betaine. The term 'standard conditions 1 
designates a standard procedure used in heterologous 
hybridization to screen for genes with enough sequence 
'homology (Maniatis, 1989). Basically, the hybridization 
filters are washed first in low stringency conditions and the 
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stringency is gradually increased (usually by increasing the 
washing temperature) in order to select the positive signals 
from the background. For example, plaques or colonies of a 
gene bank to be screened can be transferred onto 
nitrocellulose membranes and hybridized with a PCR fragment, 
oligonucleotide or any other cloned DNA containing a fragment 
of the above gene of interest. The probe can be prepared for 
example by PCR as described in Example 6. Hybridization can 
be carried out at 42°C in 50 mM Tris-HCl, pH 7.5, 10 mM EDTA, 
1 M NaCl, 0.5% SDS, 0.1% sodium pyrophosphate, lOx Denhardt's 
solution (Maniatis, 1989), 100 fig herring sperm DNA and 125 
/xg/ml polyA. The filters can be first washed with low 
stringency conditions, for example, at 37 °C using 3x SSC, 
0.5% SDS and 10% sodium pyrophosphate (Maniatis, 1989). The 
filters can then be exposed to x-ray film to monitor the 
number of positive clones. The washing temperature will be 
raised in 2-5°C intervals up to a temperature at which the 
background becomes invisible and only a few positive plaques 
are obtained. The positive plaques or colonies can then be 
purified and the DNA can be isolated for Southern blot 
hybridization to check the size of the cloned insert for 
example. The cloned DNA obtained can be sequenced. On the 
basis of the sequence homology, it can be concluded that the . 
DNA contains the gene of interest. 

Preferably, a nucleotide sequence which hybridizes to a 
nucleic acid molecule comprising a DNA sequence from 
nucleotide 1048 to 1902 of SEQ ID NO:l or a DNA sequence from 
nucleotide 1031 to 1867 of SEQ ID NO: 5 has about 60 %, 
preferably' 70 %, more preferably 80 % and especially 90 % 
homology to a nucleotide sequence corresponding to a DNA 
sequence from nucleotide 1048 to 1902 of SEQ ID N0:i or a DNA 
sequence from nucleotide 1031 to 1867 of SEQ ID NO: 5. 

A fragment of a nucleotide sequence from nucleotide 1048 to 
1902 of SEQ ID NO:l or a nucleotide sequence from nucleotide 
1031 to 1867 of SEQ ID NO: 5 is designated as any nucleic acid 
fragment, for example DNA or RNA, preferably DNA, which 
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encodes a methyltransf erase capable of converting sarcosine 
(N-methyl glycine) to dimethyl glycine and/or dimethyl 
glycine (*N, N-dimethyl glycine to glycine betaine. 

A preferred nucleic acid molecule according to the invention 
comprises the DNA sequence. from nucleotide 1048 to 1902 of 
SEQ ID NO:l. ♦ 

A further preferred nucleic acid molecule according to the 
invention comprises the DNA sequence from 1031 to 1867 of SEQ 
ID NO: 5. 

Other preferred nucleic acid molecules according to the 
invention comprise the DNA sequence from nucleotide 221 to 
1867 of SEQ ID NO:5 or the DNA sequence from nucleotide 208 
to 1902 of SEQ ID NO:l, the DNA sequence from nucleotide 221 
to 1867 of SEQ ID NO: 5 being more preferred . 

A further aspect of the present invention provides a nucleic 
acid molecule which encodes an enzyme capable of converting 
S-adenosyl methionine from methionine andyATP. 

In a preferred embodiment of the nuclic acid molecule 
provided by the present invention, a nucleic acid molecule 
which encodes an enzyme capable of converting S-adenosyl 
methionine from methionine and ATP (SAMS) comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 2027 to 2722 of SEQ ID 
N0:1 and 

a DNA sequence from nucleotide 2006 to 3004 of SEQ ID 

N0:5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in (a) 
and .(b) , and 

(d) a nucleotide sequence which is degenerate as a result of 
the genetic code to any of the nucleotide sequences defined 
in (a) , (b)and (c) . 
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A nucleotide sequence which hybridizes to a nucleic acid 
molecule comprising a DNA sequence from nucleotide 2027 to 
2722 of SEQ ID NO:l or a DNA sequence from nucleotide 2006 to 
3004 of SEQ ID NO: 5 is designated as any nucleotide sequence, 
for example DNA or RNA, preferably DNA, which hybridizes 
under standard conditions to a DNA sequence from nucleotide 
2027 to 2722 of SEQ ID N0:1 or a DNA sequence from nucleotide 
2006 to 3004 of SEQ ID N0:5 and encodes enzyme capable of 
converting S-adenosyl methionine from methionine and ATP. The 
term ' standard conditions ' designates a standard procedure 
used in heterologous hybridization, to screen f or genes with 
enough sequence homology (Maniatis, 1989) . Basically, the 
hybridization filters are washed first in low stringency 
conditions and the stringency is gradually increased (usually 
by increasing -the washing temperature) in order to select the 
positive signals from/ the background. For .example, plagues or 
colonies of a gene bank, to be screened can be transferred 
onto nitrocellulose membranes and hybridized with a PCR 
fragment, oligonucleotide or any other cloned DNA containing 
a fragment of the above gene of interest. The probe can be 
prepared for example by PCR as described in Example 6. 
Hybridization can be carried out at 42 °C in 50 mM Tris-HCl, 
pH 7.5, 10 mMEDTA, 1 M NaCl, 0.5% SDS, 0.1% sodium 
pyrophosphate, lOx Denhardt 's solution (Maniatis, 1989) , 100 
pig herring sperm DNA and 125 /ig/ml polyA. The filters can be 
first washed with low stringency conditions, for example, at 
37°C using 3x SSC, 0.~5% SDS and 10% sodium pyrophosphate 
(Maniatis, 1989) . The filters can then be exposed to x-ray 
'film to monitor the number of positive clones. The washing 
temperature will be raised in 2-5°C intervals up to a 
temperature at which the background becomes invisible and 
only a few positive plaques are obtained. The positive 
plaques or . colonies can then be purified and the DNA can be 
isolated for Southern blot hybridization to check the size of 
the cloned insert for example. The cloned DNA obtained can 
be sequenced. On the basis of the sequence homology, it can. 
be concluded that the DNA contains the gene of interest. 
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Preferably, a nucleotide sequence which hybridizes to a 
nucleic acid molecule comprising a DNA sequence from 
nucleotide 2027 to 2722 of SEQ ID NO: I or a DNA sequence from 
nucleotide 2006 to 3004 of SEQ ID NO: 5 has about 60 %, 
preferably 70 %, more preferably 80 3r and especially 90 % 
homology to a nucleotide sequence corresponding to a DNA t 
sequence from nucleotide 2027 to 2722 of SEQ ID N0:1 or a DNA 
sequence from nucleotide 2006 to 3004 of SEQ ID N0:5. 

A fragment of a nucleotide sequence from nucleotide 2027 to 
2722 of SEQ ID N0:1 or a DNA sequence from nucleotide 2006 to 
3004 of SEQ ID NO: 5 is designated as any nucleic acid 
fragment, for example DNA or RNA, preferably DNA, which 
encodes enzyme capable of converting S-adenosyl methionine 
from methionine and ATP . 

A preferred nucleic acid molecule according to the invention 
comprises the DNA sequence from nucleotide 2027 to 2722 of 
SEQ ID NO : 1 . 

A further preferred 1 , nucleic acid molecule according to the 
invention comprises the DNA sequence from nucleotide 2006 to 
3004 Of SEQ ID N0:5. 

Further subject matter of the. invention is a DNA probe for 
use in identifying and cloning a nucleic acid molecule 
encoding a methyltransf erase comprising at least 15 
nucleotide bases, preferably 20 or more nucleotide bases, of 
a nucleotide, sequence selected from the group consisting of : 
(a) a DNA sequence from nucleotide 208 to 1047 of SEQ ID 
NO:l, 1 

a DNA sequence from nucleotide 2,21 to 1024 of SEQ ID 

NO:5 ( 

a DNA sequence from nucleotide 1048 to 1902 of SEQ ID 

N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ ID 

N0:5. 
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Said DNA probes can be utilized in a-method according to the 
invention for identifying and' cloning a nucleic acid molecule 
encoding a methyl transferase comprising the steps of ^ 
hybridizing said probe with a sample containing nucleic acid 
of an organism, detecting a nucleic acid molecule in said 
sample which hybridizes to said probe and isolating said 
detected nucleic acid molecule. Preferred methods include the 
use of the polymerase chain reaction (PCR) and Southern 
blotting techniques which are described herein and are 
familiar to the skilled person in the art. , 

Further subject matter of the invention are vectors for 
expression of* the proteins according to the invention in 
prokaryotic and eukaryotic hosts. 

In this connection, expression vectors, for example phages, 
plasmids and DNA or RNA viruses, are capable of transforming 
and/or replicating and expressing the proteins of the present 
invention in prokaryotes and/or eukaryotes, for example ^ 
bacteria, yeast, fungi and/or plants. Such expression vectors 
and methods for their construction are known to the skilled 
person and can be provided with nucleic acid elements for 
transcription, for example start codons, ' TATA 1 boxes, 
promoters, enhancers, stop codons, etc., and nucleic acid 
elements important for translation and processing of the 
nucleic acids transcribed from said vectors in a given host, 
for example ribosome binding sites, leader sequences for 
secretion of the proteins of the present invention, etc. 

One embodiment of the invention is an expression vector 

comprising a nucleic acid sequence which encodes a 

methyl transferase capable of converting glycine to sarcosine 

(N-methyl glycine) and/or sarcosine to dimethyl glycine (N,N- 
dimethyl glycine) and/or a nucleic acid sequence which 1 
encodes a methyltransferase capable of converting sarcosine 

(N-methyl glycine) to dimethyl glycine and/or dimethyl 
glycine (N, N-dimethyl glycine) to betaine. 
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In a preferred embodiment, said expression vector comprises 
at least one nucleotide sequence selected from the group 
consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of. SEQ ID 
NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ. ID 

NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ ID 

N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ ID 

NO: 5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in (a) 
and (b) , and 

(d) a nucleotide sequence which is degenerate as a result of 
the genetic code to any of the nucleotide sequences. defined 
in (a) , (b)and (c) . 

In a further preferred embodiment, an expression vector is 
provided comprising a nucleotide sequence coding for an 
enzyme capable of catalyzing the synthesis of S-adenosyl 
methionine and at least one nucleotide sequence selected from 
the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ ID 
N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ ID 

NO:5, r 

a DNA sequence from nucleotide 1048 to 1902 of SEQ ID 

NO:l, ( 
a DNA sequence from nucleotide 1031 to 1867 of SEQ ID 

NO: 5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in (a) 
and (b) , and 
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(d) a nucleotide sequence which is degenerate as a result of 
the genetic code to any of the nucleotide sequences defined 
in (a) , (b)and (c) . 

When an expression vector of the present invention contains 
one of the above mentioned DNA sequences, preferred 
expression vectors comprise a nucleotide sequence selected 
from the group consisting of: 

(a) the DNA sequence from nucleotide 208 to 1047 of SEQ ID 
NO:l, 

the DNA sequence from nucleotide 221 to 1024 of SEQ ID 

NO:5, 

the DNA sequence from nucleotide 1048 to 1902 of, SEQ ID 
NO:l, or 

the DNA sequence from nucleotide 1031 to 1867 of SEQ ID 

NO:5. 

When an expression vector of the present invention contains 
two of the above mentioned DNA sequences, preferred 
expression vectors comprise the DNA sequence from nucleotide 
208 to 1047 of SEQ ID NO:l and the DNA sequence from 
nucleotide 1048 to 1902 of SEQ ID N0:1 or the DNA sequence 
from nucleotide 221 to 1024 of SEQ ID NO:5 and the DNA 
sequence from nucleotide 1031 to 1867 of SEQ ID NO: 5 # an 
expression vector comprising the DNA sequence from nucleotide 
221 to 1024 of SEQ ID NO : 5 and the DNA sequence from 
nucleotide 1031 to 1867 of SEQ ID N0:5 being more preferred;- 

However, expression vectors comprising fragments and/or 
derivatives of the above mentioned sequences as well as other 
combinations of the above mentioned DNA sequences, for 
example a DNA sequence from nucleotide 208 to 1047 of SEQ ID 
NO:l and a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5 or a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5 and a DNA sequence from nucleotide 1048 to 1902 of 
SEQ ID NO:l or fragments and/or derivatives thereof are also 
subject matter of the present invention. 
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As provided for above, expression vectors of the present 
invention can additionally comprise a nucleic acid molecule 
coding £ or an enzyme capable of directly or indirectly 
increasing the intracellular amount of S-adenosyl methionine. 

•The genes encoding E. halochloris and A. halophila GSMT and 
SDMT enzymes are located in a "betaine operon" . In E. 
halochloris the enzymes are encoded by two separate genes, 
whereas in A. halophila the two enzymes are coded by a single 
gene. In addition, the "betaine operon" contains a S-adenosyl 
methionine synthase (SAMS) gene. The SAMS enzyme catalyzes 
the synthesis of S-adenosyl methionine (SAM) from methionine 
and ATP, and thus / it is useful in the methylation reactions 
of the methyltransferases of the invention because it 
increases the concentration of the enzyme substrate SAM. 
Therefore co-expression of the SAMS gene with one or more of 
the methyltransf erase genes of the invention can be used to 
increase betaine synthesis in these organisms. 

Hence, in a preferred embodiment,, the above mentioned 
expression vectors can additionally comprise a DNA sequence 
from nucleotide 2027 to 2722 of SEQ ID NO:l or a DNA sequence 
from nucleotide 2006 to 3004 of SEQ ID NO: 5 or fragments 
and/or derivatives thereof when the organism to be 
transformed is E halochloris or A. halophila. ' 

Preferred expression vectors of this type, which also encode 
the methyltransferases of the present invention, comprise a 
DNA sequence from nucleotide 208 to 2722 of SEQ ID N0:1 or a 
DNA sequence from nucleotide 221 to 3004 of SEQ ID N0:5, the 
latter being more preferred. 

Alternatively, said nucleic acid molecule coding for an 
enzyme capable of directly or indirectly increasing the 
intracellular amount of S-adenosyl methionine originates or 
is derived from the organism which is to be transformed with 
said expression vector. N For example, if ,the organism to be 
transformed with a nucleic acid, for example an expression 
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vector, according to the invention is E. coli, then it is 
possible for example to incorporate the gene coding for S- • 
adenosyl methionine synthase from E. coli (see Markham et 
al., 1984) . In a similar manner, if the organism to be 
transformed with a nucleic acid, for example an expression 
vector, according to the invention is Bacillus, then it is 
possible for example to incorporate the gene coding for S- 
adenosyl methionine synthase from Bacillus subtilis (see 
Yocum et al., 1996, and genebank accession number AF008220) . 
Other nucleotide sequences which can be used for this purpose 
are the SAH hydrolase form Mesembryantheraum crystallinum 
(genebank accession number U797 66; Arabidopsis thaliana, 
accession number AF059581; S. pombe, accession number 
AL022072) . 

in another aspect of the present invention, the expression 
vectors can additionally comprise a nucleic acid molecule 
coding for an enzyme capable of increasing the intracellular 
amount of intracellular glycine. Preferably, said nucleic, 
acid molecule coding for an enzyme capable of directly or 
indirectly increasing the .intracellular amount of glycine 
originates or is derived from the organism which is to be 
transformed with said expression vector. 

For example, the expression vector can include a nucleotide 
sequence encoding the enzyme phosphoglycerate dehydrogenase 
(Bacillus subtilis, accession number L47648; S. pombe, 
accession number AL022243; Arabidopsis thaliana, accession „" 
number AB010407) , phosphoserine aminotransferase (E. coli, 
accession number AE000193, U00096; Bacillus subtilis, 
accession number Z99109, AL009126; S. pombe, accession number 
Z69944; Arabidopsis thaliana, accession number ALO 31135) , ■ 
phosphoserine phosphatase (E. coli, accession number 
AE000509; S. cerevisiae, accession number U3 6473; S. pombe, 
accession number D89261) and serine hydroxymethyl transferase 
(Bacillus thearothermophiiius, EO2190; Candida albicans, 
accession number AE009966; Zea mays, accession number 
W49449) . 
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Further subject matter of the present invention is a 
recombinant prokaryotic or eukaryotic organism, for example, 
bacteria, yeast, fungus or plant, transformed with at least 
one nucleic acid molecule of the invention as defined above, 
for example, an expression vector according to the invention 
as defined above. 

When a recombinant organism according to the invention is a 
bacterium, said bacterium is preferably selected from the 
group consisting of E. coli, Bacillus, Corynebacteria, 
Pseudomonas and lactic acid bacteria and Streptomyces . 

Numerous genetic tools for expressing genes in lactic acid 
bacteria have been developed in the past few years. The 
field has extensively been reviewed by Kuipers et al. (1997) 
This opens up many possibilities to develop an inducible 
expression system for the GSMT or SDMT genes in lactic acid 
bacteria, in addition, the methodology to express 
heterologous genes in Bacillus is well established and there 
are number of functional expression systems facilitating the 
overexpression of the GSMT and SDMT in Bacillus. (Sarvas, M. 
• (1994)) . 

When the recombinant organism according to the invention is 
yeast, said yeast is preferably selected from the group 
consisting of Saccharomyces, Zygosaccharomyces , Pichia, 
Kluyveromyces , Candida and Hansenula. ^ 

When the recombinant organism according to the invention is 
fungus, said fungus is preferably selected from the group 
consisting of Aspergillus, Trichoderma and Penicillium. 

When the recombinant organism according to the invention is 
plant including but not limited to cereals, legumes, 
oilseeds, vegetables, fruits, ornamentals and perennials, 
, said plant is preferably selected from the group consisting 
of lettuces, Capsicums, grasses, clovers, alfalfa, beans, 
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sweet potatoes, cassava, yams, taro, groundnut, brassica, 
sugar beet, grapes, potato, tomato, rice, tobacco,, rapes eed (/ 
maize, sorghum, cotton, soybean, barley, wheat, rye, canola, 
sunflower, linseed, pea, cucumber, carrot, ornamentals, 
perennial trees including citrus pear and almond and fruits 
including strawberry. 

In this connection, the term 'plant' according to the 
invention is understood to include individual cells of a 
plant, plant seeds and callus material. 

Further subject matter of the present invention is a method 
for the production of a recombinant organism according to the 
invention comprising the steps of transforming a host 
prokaryotic or eukaryotic organism, preferably a bacteria, 
yeast or fungus, with at least one nucleic acid molecule of 
the invention as defined above, for example an expression 
vector according to the invention as defined above. When the 
host ^organism is to be transformed with a nucleic acid 
molecule coding for an enzyme capable of increasing the 
intracellular amount of S-adenosyl methionine and/or glycine 
which originates from the host organism, this nucleic acid, 
for example an expression vector, can be transformed as a 
separate molecule' or can be cloned into a expression vector 
according to the invention. Likewise, when the host organism 
is to be transformed with two different nucleic acid 
molecules each encoding a different methyl transferase of the 
invention, then transformation can be performed using these 
two nucleic acid molecules, e.g. expression vectors. 

In addition, the present invention relates to, a 
methyl transferase obtainable by culturing wild- type 
Ectothiorhodospira or Actinopolyspora or a recombinant 
prokaryotic or eukaryotic organism according to the invention 
and isolating said methyltransf erase from the organism and/or 
the medium used to culture or process said organism as well 
as a method for the production of said methyl transferase 
comprising the/ above mentioned steps . 
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In this connection, a method for the purification of a 
methyl transferase capable of catalyzing the conversion of 
glycine to dimethyl glycine comprising the steps of 
subjecting a sample comprising the methyl transferase to a 
matrix containing adenosine, binding said methyl transferase . 
to said matrix and eluting said methyl transferase from said 
matrix is also subject matter of the present ( invention. In 
addition, the above purification step can be combined with 
other methods of protein purification including ammonium 
sulfate precipitation, size exclusion chromatography, cation 
or anion exchange chromatography, hydrophobic interaction 
chromatography, etc. 

By expressing the genes encoding the. GSMT and SDMT enzymes, 
it is possible to impart. the capability of de novo synthesis 
of betaine to different organisms. Suitable host organisms 
are practically all bacteria which can be transformed with, 
foreign DNA (for instance E. coli, Bacillus, Cbrynebacteria, 
Pseudomohas, lactic acid bacteria and Streptomyces) yeast 
(for instance Saccharomyces, Zygosaccharomyces, Pichia, 
Kluyverorayces, Candida and Hans enul a) fungi (for instance 
Trichoderma/ Aspergillus, Penicillium) or plants. 

Therefore, subject matter of the present invention is a 
method for. the production of betaine comprising the steps. of 
culturing a recombinant organism according to the invention 
and isolating betaine from the organism and/or the medium 
used to culture or process said organism. 

Further subject matter of the present invention is a method 
for the production of sarcosine and/or dimethyl glycine 
comprising the steps of culturing an organism transformed 
with a nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ ID 
NO:l, \ 
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NO:5, 

(b) a nucleotide sequence which hybridizes to any of the 
sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in (a) 
and (b) , and 

(d) a nucleotide sequence which is degenerate as a result of 
the genetic code to any of the nucleotide sequences defined 

in (a) , (b)and (c) , 

and isolating sarcosine and/or dimethyl glycine from 
said organism or the medium used to culture or process said 
organism. 

Further subject matter of the invention is a method for 
increasing the intracellular concentration of sarcosine, 
dimethyl glycine and/or betaine in an organism, enhancing the 
general productivity of an organism, enhancing the salt 
tolerance of an organism, enhancing the freezing or cold 
tolerance of an organism, and/or enhancing the resistance of 
an organism to drought and/or low water stress comprising the 
steps of transforming an organism with at least. one nucleic 
acid molecule of the invention as defined above, for example 
an expression vector according to the invention as defined 
above. Preferably, said organism is a bacteria, yeast, fungus 
or plant as recited above. . ( 

Further subject matter of the invention is a method for n 
inducing pathogenesis-related proteins in a plant, increasing 
the resistance of a plant to attack by pathogens and/or 
increasing the nutritional value of a plant comprising the 
steps of transforming a plant with at least one nucleic acid 
molecule of the invention as defined above, for example an 
expression vector according to the invention as defined 
above. Preferably, said plant is a plant as recited above. 

Pathogens include but are not limited to Fusarium sp. which 
cases root, shoot and leaf diseases in several plant types, 
Khizoctonia sp. and Pythium sp. which cause soil borne 
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diseases in crops, Erysiphe sp. which cause mildew in several 
species, Phytophthora infestans which causes late blight in 
potato and tomato, Alternaria solani which causes early 
blight in potato, fungal diseases of soya caused by 
Cephalosporium sp., Diaporthe sp., Cerospora sp.^Septoria sp. 
and Peronospora sp., nematodes for example Meloidogne 
jayanica and M. incognita and insects, 
i 

Moreover, subject matter of the invention is a method for 
enhancing the pH tolerance and/or viability of a cultured 
microorganism comprising the steps of transforming a 
microorganism with at least one nucleic acid molecule of the 
invention as defined above, for example an expression vector 
according to the invention as defined above. Preferably, said 
microorganism is a bacteria, yeast or fungus as recited 
above. 

The microorganisms of the invention can also be used as hosts 
in the field of recombinant DNA technology for the expression 
of a heterologous protein of interest. Therefore, subject 
matter of the invention is a method for decreasing inclusion 
body formation, increasing the stability of a heterologous 
protein and/or increasing the production of a heterologous 
protein expressed in a microorganism comprising the steps of 
transforming a microorganism with at least one nucleic acid 
molecule of the invention as defined above, for example an 
expression vector according to the invention as defined 
above, and transforming a microorganism with a nucleic acid 
molecule capable of expressing said heterologous protein. 
Said microorganism can be transformed with the nucleic acid 
molecule, for example an expression vector, according to the 
invention before, during or after the microorganism is 
transformed with a nucleic acid molecule capable of 
expressing said heterologous protein. 

t. 

Additional subject matter of the invention is an animal feed 
or animal feed ingredient comprising a recombinant organism 
according to the invention. 
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The present invention is tore closely illustrated by means of 
the following examples without limiting the invention to the 
examples. 

Examples 

Example l. Demonstration of the methyl transferase pathway in 
Actinopolyspora halophila^ and Ectothiorhodospira halochloris 

Preparation of the cell extracts 

The growth medium of Actinopolyspora halophila ATCC 27976 . 
used in all cultivations was the "complex medium" described 
by Sehgal and Gibbons (1960) . Inoculum was grown at 37<»C in a 
shake flask with agitation at 180 rpm until the late 
exponential -growth phase.. Then, . 8. 1 of the above medium 
with 10' g/1 glucose was inoculated with 800 ml culture. The 
pH in the f ermentor (Biostat M (Braun) laboratory f ermentor) 
was maintained at pH 6.5-7.5 with 0.5 M H 2 S0 4 • and 1 M NaOH. * 
Agitation and aeration rates were 400 rpm and 10 1/min. , 
respectively. The cultivation temperature was 37°C. Cells 
were grown to late exponential phase and harvested by 
centrifugation at 15; 000 g for 15 min. Cells were stored at 
-75°C. Before disruption, the cells were thawed and suspended 
in .Buffer I (22 % (w/v) sucrose, 27 mM EDTA, 1 mM _ 
phenylmethylsulfonyl fluoride (PMSF) , 50 mM Tris-HCl, pH.7.5) 
in a ratio of 1.5 ml buffer 1:1 g cells (wet weight).. 

The growth medium used in the cultivation of 
Ectothiorhodospira halochloris ATCC 35916 is described by 
Tschichholz and Truper (1990) Cultivation was carried out 
anaerobically at 42°C in 1 1 glass bottles with continuous 
stirring with magnetic stirrer. The cells^ were illuminated 
during growth (5,000 - 10,000 lux). 100 ml of pre-inoculum 
was inoculated into 1 1 medium. Cells were grown until late 
exponential phase and harvested by centrifugation at 28,000 g 
for 20 min. After centrifugation, the cells were suspended in 



WO 00/11142 



37 



PCT/EP99/06037 



Buffer II (560 tnM Tris-HCl, pH 7.5, 4 mM 2-mercaptoethanol, 
50 vM MgCl 2 , 160 pM EDTA) and disrupted with 1 mM PMSF and 1 
mM dithiothreitol (DTT) . Buffer II was added in a ratio of 
1.5 ml buffer II: 1 g cells (wet weight) . 

The cells were disrupted with a MSE Soniprep 150 sonicator. 
The suspension of A. halophila cells was sonicated in 20 ml 
batches (sonicat ion pulses 30 s, cooling intervals 2 min) for 
1 min / 2 ml cell suspension. The suspension of E. 
halochloris cells was sonicated in 5 ml batches (sonication 
pulses 15 s; cooling intervals 2 min) for i min/1.5 ml cell . 
suspension. The cell debris was removed by centrifugation at 
28,000 g at 1°C for 30 min. The cell free extracts were 
stored at -75°C. 

Methyltransferase activity assay 

Reactions were carried out in 1.5 ml Eppendorf tubes with 
caps. The reaction mixture contained 25 /il of 0.1 M substrate 
(glycine, sarcosine or dime thylgly cine) , 25 /il of Buffer II 
(see above), 25 fil 4 mM s-adenosyl-L-methionine containing 45 
nCi S-adenosyl-L- [methyl- 14 C] methionine (Amersham) in 1/10 
Mcllvaine buffer (pH 3.0) , and 25 /il enzyme sample (e.g. cell 
free extract). The reaction was initiated by adding the 
enzyme. The reaction mixture was incubated for 30 min. at 
37 °C and the reaction was stopped by adding 75 fil of charcoal 
suspension (133 g/1 in 0.1 M acetic acid). The excess 
charcoal selectively adsorbs unreacted S-adenosyl methionine. 
The reaction mixtures were then incubated for 10 min at 0°C 
arid centrifuged for 10 min in a Heraeus table top centrifuge. 
75 fil of the supernatant was added to 4.5 ml of aqueous 
scintillant (Hionic-Fluor, Packard) and the radioactive 
methylation products were measured in a liquid scintillation 
counter (Beckman LS 6000 IC) . The enzyme sample was diluted 
to keep the reaction on the linear range (radioactivity of 
the supernatant below 10,000 DPM) . 

The cell extracts typically contain the following activities. 
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Table I! Methyltransf erase activities of A. halophila and E. 
halochloris cell extracts on different substrates. 



Organism 


*Activity/DPM/30 min. incubation 




Glycine 


Sarcosine 


Dime thy lglycine 


A. halophila 


900 


89,000 


84,000 


E . halochloris 


19,000 


76,000 


r 

854,000 



The activity is expressed in units based on the 
radioactivity, of the reaction products after 30 min. 
incubation 



Characterization of the methylation reaction products in the 
reaction mixtures t 

The reaction products were characterized by HPLC. The 
reaction mixture supernatants as described above were 
filtered after centrifugation through a Minisart NML 0.2 (im 
filter (SartoriusyAG) and a 25 f 100 /xl sample was analyzed 
on AminexHPX-87C cation exchange column (300 x 7.8 mm) 
(BioRad Laboratories) . The HPLC system used was a Varian 500 
equipped with a HP 1047 (B) refractive index detector and a ' 
Waters VISP717 injector. A /xbondapack C 18 -precolumn was used 
in the system. 5 mM CaSC>4 was used as the eluent and the flow 
rate was 0.6 ml /min, A 1 mM mixture of sarcosine, 
dimethylglycine and betaine were used as standards. In order 
to detect the radioactive products formed in the enzymatic 
reaction, 200 /zl fractions were collected during the 
chromatographic run. The fractions were analyzed in a liquid 
scintillation counter as described above. The dead-volume 
between the detector and sample outlet was determined by 
radioactive betaine and the retention time of the different 
fractions was calculated from the sample volume and eluent 
flow rate. The radioactivity of the fractions was plotted to 
the same Figure with the standards, which were used to 
identify the reaction products. 
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The results obtained by using A. halophila cell extracts are 
presented in Figure 1. The results indicate that all 
methylation products in the three- step reaction are formed 
during the incubation. 



f 



Example 2 .. Purification of glycine- sarcosine 
methyltransf erase (GSMT) of E. halochloris 

20 mM Tris-HCl buffer (pH. 7.5) was used through the 
purification procedure unless otherwise stated. All the 
buffers used contained 1 mM dithiothreitol . ^ 

Step 1: Attimonium sulphate fractionation. 25 ml of cell free 
extract (as described in example 1) was diluted to 90 ml and 
saturated ammonium sulphate in 50 mM Tris-HCl, pH 7.5, was 
added to achieve 20 % saturation. The solution was incubated ; 
for 30 min at 0°C and centrifuged at 15,000 g. The 
precipitate was discarded and the supernatant purified 
further. 

Step 2. Hydrophobic interaction chromatography. The 
supernatant from step 1 (105 ml) was, applied to a Butyl 
Sepharose 4 FF (Pharmacia) (10 x 50 mm) column 
pre-equilibrated with 20 % (w/v) ammonium sulphate in 20' mM 
Tris-HCl, pH 7.5. The column was washed with 45 ml of 20% 
(w/v) ammonium sulphate in 20 mM Tris-HCl and eluted with a 
linear gradient of 20-0% ammonium sulphate. The volume of the 
gradient was 80 ml and the flow rate' was 2 ml/min. Fractions 
< of 3 ml were collected. The active fractions (40 ml) were 
pooled. The ammonium sulphate was removed by gel filtration 
(Sephadex G-2S, Pharmacia) . 

Step 3. ion exchange chromatography. The sample from step 2 
(73 ml) was applied to a DEAE-Memsep 1000 HP (Millipore) (1.4 
ml) column pre-equilibrated with 20 mM Tris-HCl, pH 7.5. The 
column was -washed with 15 ml of buffer and eluted with a 
linear NaCl gradient (0 - 1 M) . The volume of the gradient 
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was 60 ml and the flow rate was 3 ml/min. 2 ml fractions were 
collected The active fractions (8 ml) were pooled and 
concentrated by ultrafiltration (Amicon Centriplus 30; 
Ultrafree MC 10,000 NMWL filter unit Millipore) to 100 fil. 

Step 4. Gel filtration. The concentrated sample from step 3 
(100 fil) was applied to a Superose 12 HR 30 (Pharmacia) 
column 20 mM Tris-HCl, pH 7.5, containing 150 mM NaCl was 
used as the elution buffer with flow rate of 0.4 ml/min.. 0:5 
ml fractions were collected. The fractions containing 
glycine- s'arcosine methyltransf erase (GSMT) activity (1.5 ml) 
were collected and concentrated by ultrafiltration (Ultrafree 
MC 10,000 NWML filter unit, Millipore) to 100 ill. 

The purity and the molecular weight were determined by 
gradient SDS-polyacrylamide gel electrophoresis according to 
the following procedure. Electrophoresis under denaturing 
conditions was carried out using pre-made polyacrylamide gel 
slabs (12 % Tris-glycine gel with 4 % stacking gel, Ready 
Gels, Biorad) according to the instructions of the 
manufacturer. Mid range molecular weight standard from 
Promega was used. Staining of the gel was performed with 
0.25% (w/v) Coomassie Blue R-250 (Promega) in 50 % (v/v) 
methanol and 10% (v/v) acetic acid. Stained gels were 
destained with 10% (v/v) methanol and 5% acetic acid 
(Laemmli, 1970) . 

A typicai SDS-gel of the purified E. halochloris GSMT is 
shown in Figure 2 . 

Example 3 . Purification of sarcosine-dimethylglycine 
methyltransf erase (SDMT) of Actinopolyspora halophila 

An affinity purification method was developed for the 
purification of the sarcosine-dimethylglycine 
methyltransf erase (SDMT) . Purification of the protein was not 
achieved by standard purification methodology. The affinity 
column was prepared as follows. 5 'AMP-Sepharose 4B 



WO 00/11142 PCT/EP99/06037 

41 

(Pharmacia) was treated with alkaline phosphatase to remove 
the phosphate group of the ligand. The gel was first swollen ^ 
in water (5 ml of distilled water was used per 1 g of dry 
5'AMP-Sepharose 4B) . The swollen gel was then washed with 200 
ml of distilled water. The gel was equilibrated with 
CIP -buffer (10 mM MgCl 2 , l tnM dittiiothreitol, 50 mM NaCl, 10 
mM Tris-HCl, pH 7.9). 10 ill (100 U) of CIP (Calf Intestinal 
Alkaline Phosphatase (Finnzymes)) was added to the gel. The 
gel was incubated for 2 h at 37 °C with occasional shaking. 
The reaction was stopped by washing the gel with 20 mM L 
Tris-HCl, pH 7.5. , 

18 ml of A. halophila cell free extract (as described in 
example 1 ) was applied to a column of adenosine -Sepharose 
(10 x 90 mm) pre-equilibrated with 20 mM Tris-HCl, pH 7.5. 
The column was washed with 20 mM Tris-HCl, pH 7.5, using 0.5 
-ml/min flow rate until the absorbence at 280 nm. became 
constant . The protein bound to the column was eluted with l 
mM S-adenosyl methionine (20 mM Tris-HCl, pH 7.5) . The active 
fractions were pooled (25.2 ml) and concentrated by 
ultrafiltration (Amicon Centriplus 30) . The purity and the 
molecular weight was determined by gradient ; 
SDS-polyacrylamide gel electrophoresis as described above. 

A typical SDS gel of the purified A. halophila SDMT is shown 
in Figure 2 . 

Example 4. Characterization of the properties of A. halophila 
SDMT protein 

Determination of the molecular weight and sub-unit structure 

Based on the SDS-electrophoresis (Figure 2), the molecular 
weight of A. halophila SDMT is approximately 32 kDa. 



The molecular weight was also determined by gel filtration^ 
with Superose 12 HR 30 (Pharmacia) column. The flow rate wa 
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0.4 ml/ml. The elution buffer was 20 mM Tris-HCl pH 7.5 
containing 150 mM NaCl. 1 

The molecular weight was calculated from a calibration curve 
made with a mixture of standard proteins . The mixture 
contained 0.5 mg/ml Blue Dextran (0.5 mg/ml), Ferritin (440 
kDa), 7.0 mg/ml aldolase (158 kDa), 2.0 mg/ml ovalbumin (43 
kDa) and 1.0 mg/ml chymotrypsinogen (25 kDa) . The calculated 
molecular weight was 31.6 kDa, which indicates that the 
protein is a monomer. 

The isoelectric point of the enzyme ( ^ 

, . ' \ . "... 

The isoelectric focusing was performed with Pharmacia Phast 
system using gels 'with pH-gradient from pH 3 to 9 (IEF 3-9) . 
A mixture of Pharmacia IEF standard proteins with pis from 
3.5 to 9.3 were used as standards. The gels were stained by 
silver staining as described in example l. The results shown 
in Figure 3 show that the pi of the protein is approximately 
4.1-4.2. ' " . ? 

Substrate specificity 

The activity of the purified protein was determined as 
described in example, 1 with glycine, sarcosine and dimethyl 
glycine. The data presented in table 2 demonstrates that the 
isolated protein catalyzes step methylation reaction from 
sarcosine to dimethyl glycine and from dimethyl glycine to 
betaine. " 

Table 2. The activity of A. halophila SDMT on different 
substrates. 



Substrate 


♦Activity (DPM/30 min.) 


Glycine 


0 


Sarcosine 


14,500 


Dimethylglycine 


45,800 
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* The activity is expressed in arbitrary units based on 

the radioactivity of the reaction products after 30 min. 
incubation;. 

pH Optimum 

The pH-optimum of the two methylation reactions were 
determined by using following buffers: 0.1 M piperazine 
buffer, pH 5.0; 0.1 M Bis-Tris buffer, pH 6.0; 0.1 M Bis-Tris 
buffer, pH 7.0; 0.1 M Tris-HCl, pH 8.0; 0.1 M Tris-HCl, pH 
9.0; 0.1 methanolamine, pH 10. The exact pH values of the 
reaction mixtures were measured. It can be concluded from 
Figure 4 that the pH optimum of the both enzyme reactions is 
at pH 7.5. 

Temperature optimum 

The temperature dependence of the enzymatic reactions were 
determined with sarcosine and dimethyl glycine. As seen in 
Figure 5 . the temperature optimum is approximately 45 - 50°C. 
When the temperature is elevated above 50 °C, the enzyme is 
rapidly inactivated. 

Example. 5 In vitro synthesis of betaine by using purified E. 
halochloris GSMT and A. halophila SDMT 

l 

The purified GSMT from E . halochloris and SDMT from A. 
halophila were concentrated by ultrafiltration (Ultrafree MC 
10,000 NWML filter unit, Millipore) to protein concentrations 
4.2 mg/ml and 5.6 mg/ml, respectively. The protein 
concentration was determined by measuring the absprbance at 
280 nm and calculated by the formula: 1 mg protein/ml = 1.0 

A 280- 

The reaction mixture contained 50 jil 5.0 mM glycine, 50 /il 32 
mM S-adenosyl methionine in water containing 640 nCi 
S-adenosyl-L-[methyl- 14 C] -methionine (Amersham) , 50 fil Buffer 
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II (see example 1), 25 \il GSMT of E. halochloris and 25 nl 
SDMT of A. halophila. The reaction was initiated by adding 
the enzymes. The reaction mixture was incubated for 2 h at 
37°C and the reaction was stopped by adding 150 ^1 of 
charcoal suspension. The reaction mixtures were then 
incubated for 10 min at 0°C and centrifuged for 10 min in a 
Heraeus table top centrifuge. The supernatants were filtered 
through Mini sart NML 0.2 fim filter (Sartorius AG) . The 
identification of the reaction products was performed by HPLC 
as described in example 2 . > 

The chromatogram is presented in Figure 6 and it shows peaks 
corresponding to the retention times of sarcosine, 
dimethylglycine and betaine. 

Example 6. Isolation of the genes of E. halochloris GSMT and 
A. halophila SDMT 

Determination of the N-terminal and internal peptide 
sequences 

The N-terminai andtryptic peptides "peptide sequences of the 
purified proteins were determined by using Perkin 
Elmer/Applied Biosystems Procise 494A protein sequencing ^ 
system as described by Kerovuo et al., 1998. The peptide 
sequences obtained are shown in table 3. 1 

Table 3. The peptide sequences obtained from purified E. 
halochloris GSMT and halophila SDMT. The sequences used to 
make the PCR primers are underlined. 

Organism/SEQ ID NO: Sequence 

v 

A. halophila 

SEQ ID NO: 9 EKSYR TEDEFVDM YSNAVHTARDYYNSEDASNFYYHV 

(N- terminus) 
SEQ ID NO: 10 GSVLFTDPMASDDAK 
SEQ ID NO: 11 TGLRNYQAGN 
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SEQ ID NO: 12 LXELGPILDRLHLDSG 

SEQ ID NO: 13 ELTRLGI^QNIEFEDLSEYLPVHYGR 

SEQ ID NO: 14 VD I S PETRI LDLGSGYGA 

E. halochloris. } 

SEQ ID NO: 15 NTTT / EEODFGAD PTKVRDTDAYTE 

(N- terminus) 

SEQ ID NO: 16 VRDTDHYTEEYVD / 

SEQ ID NO: 17 DYTRRLMHEVGFQK 

SEQ ID NO: 18 ATY RDADPDFFL HVAEK 

SEQ. ID NO: 19 VRDTDHYTEEYVDGFVDKWDDLID 



Preparation and screening of the, chromosomal gene banks 

The genomic DNA from both microbes was isolated essentially 
as described in Ausubel et al. (1991) . The chromosomal DNAs 
were partially digested with Sad and ligated to Sad 
digested dephosphorylated lambda Zap I I arms (Stratagene, La 
Jolla, California, USA) and packaged to lambda particles 
using Gigapack III Gold packing extract (Stratagene, La ; Jolla 
California, USA'} according to protocol provided by 
manufacturer. The chromosomal DNA isolated from the organisms 
was used as the template DNA in the PCR reactions. 

The probes were made by PCR using following degenerate 
primers. The primers were designed according to Sambrook et 
al. (1989). 

A. halophila 
SEQ ID NO:20 

5 ! /-GA (A/G) GA(C/T) GA (A/G) TT (C/T) GTIGA (C/T) ATG T-3' 
SEQ ID NO: 21 , 

(5 1 - (C/T) TG (A/G) TT (T/A/G) AT (T/C) TC (G/A) AA (T/C) TC (A/G) TC-3 ■ ) 

■ / 

E. halochloris 

SEQ ID NO: 22 

5'-GA(A/G)CA(A/G)GA(T/C)TT(T/C)GGIGCIGA(T/C) CC-3 ' 
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SEQ ID NO: 23 

5 • -A (A/G) (A/G) AA(A/G) AA(A/G) TCIGG (A/G) TCIGC (A/G) TC-3 1 

The amplification was performed under the following 
conditions. 

A. halophila- 3 cycles of 1 min. at 94°C for denaturation, l 
min at 37°C annealing and 2 min at 72°C for synthesis, 32 
cycles of 1 min at 94° C for denaturation, 1 min at 46°C 
annealing and 2 min at 72°C for synthesis. ArapliTac DNA 
Polymerase (Perkin Blmer) was used in the reaction. 1 mM 
MgCl 2 was added to the reaction mixture. Otherwise standard 
reagents were used. 

E. halochloris - 34 cycles of 1 min at 94° for denaturation, 
l min at 42<>C annealing, and 2 min at 12°C for synthesis. 
AmpliTac DNA Polymerase {Perkin Elmer) was used in the 
reaction. 1 % (v/v) formamide, 1% (v/v) dimethyl sulfoxide 
and 6 mM MgCl 2 were added to the reaction mixture. 
Gitsch-buffer (reference) was used in the reaction. .. 

The PCR-fragments obtained were labeled with rediprime DNA 
labelling system (Amersham Life Science) according to the 
instructions given by the manufacturer. 

A total of 50,000 plaques of both libraries were screened and 
the positive lambda clones were cored arid excised with 
ExAssist helper phage (Stratagene, Sari Diego, USA) to obtain 
phagemids. CsCl-gradient purified (Sambrook et al . , 1989) 
plasmid DNAs were used in DNA sequencing (Zagursky et al., 
1986). The lengths of the cloned fragments were 3.5 kb (A. 
halophila) and 5.0 kb (E. halochloris). 

Analysis of the sequencing data 

E. halochloris - Sequence analysis of the DNA- fragments 
indicated that the E. halochloris clone contains 3 ORFs. On 



WO 00/11142 



\ 



47 



PCT/EP99/06037 



the basis of the, pep tide sequences it can be concluded that 
the E. halochloris GSMT is encoded by the first ORF of the 
fragment; In addition, based on the sequence homology with 
the A. halophila SDMT, it can be concluded that the second ; 
ORF of E halochloris iclone contains a SDMT gene. This has 
also been demonstrated by expressing the gene in E. coli 
(Example 8)' . 

A. halophila - Sequence analysis of the DNA fragments 
indicated that the A. halophila clone contains 2 ORFs. A. 
halophila SDMT protein is coded by the 3' -end of the first 
ORF of the A. halophila clone. The 5 '-end of the same ORF is 
very homologous to the E. halochloris GSMT. These data 
indicate that the GSMT and SDMT are transcribed from a single 
gene and the protein isolated from the organism is a 
processing product. 

In addition to the methyltransf erases, the "betaine operon" 
codes for a third gene which is homologous to number of 
S-adenosyl -methionine synthases. The operon structure is 
schematically shown in Figure 7. The nucleotide and amino 
acids sequences of the' cloned genes have been shown in 
Figures 8 and 9. . * ' ■ 

' i 

Example 7 Expression of E. halochloris GSMT in E. coli 
Expression of the gene ■ .. ' 

The gene coding for the E halochloris GSMT was amplified by 
PCR. The purified plasmid used for DNA sequencing in example 
6 was used as the template for the PCR reaction. The ^ 
following primers were used in the PCR reaction: 

primer 1 : 

5 1 -CGGACCATGGATACGACTACTGAGCAG-3 1 (SEQ ID NO:24) 
(5 '-end oligonucleotide) and 
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primer 2 : 

5 ■ -GCTCAGATCTGTCCTCCTCCCGATATTCCTTCTC-3 ' (SEQ ID NO : 25 ) 
O'-end oligonucleotide) 

The 3'-end of the primers are homologous to the 5 1 - arid 
3 '-end of the GSMT gene. The 5 1 -end oligonucleotide 
hybridizes to position 221-241 and the 3' -end to the position 
1001-1024/ (See Figure 8) . The primer hybridizing to the 5'- 
end contains an extra Ncol restriction site such that the 
nucleotide A at position 224 in Figure 8 is replaced by the 
nucleotide 6 in the primer and the 3' -end primer contains a 
Bglll site which were used for cloning. 

The amplification was performed in the following conditions: 
34 cycles of 1 min at 94°C for denaturation. 1 min at 50 °C 
annealing and 2 min at 72°C for synthesis. Pfu Polymerase 
(Stratagene) was used in the reaction. The amplified 
fragments were purified with Qiaquick DNA purification Kit 
(Qiagen,. Santa Clara, USA) and ligated into t NGoI/BglII cut 
PQE-60 expression vectors (Qiagen, Santa Clara, USA) . A 
schematic presentation of the plasmid (pEGSM) is shown in 
Figure 10. Competent XL-1 Blue.MPF' cells were transformed 
with this ligation mix according to Hanahan et al . (1983). 
Plasmid minipreps of the trans formants were prepared and the 
presence of the insert was established by cutting the 
plasmids with Nco I and Bgl II and by separating the 
resulting fragments by agarose gel electrophoresis. 

GSMT transformants . were grown overnight in 2.5 ml of LB broth 
containing 100 /zg/ml „ampicillin. As a control, E. coli XLI 
Blue MRF 1 transformed with the PQE-60 without the insert was 
grown. 

0.5 ml was inoculated to 1.5 ml of LB broth with ampicillin. 
The cultures were grown at 200 rpm for 30 rain at 37° C and 
isopropyl-S-thiogalactopyranoside (IPTG) was added to a final 
concentration of 1 mM to induce the enzyme synthesis . After 3 
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h 30 min the cells were separated by centrifugation (1,000 g, 
3 min) . 

The cell pellet was suspended in 100 fil Buffer II containing 
1 tnM PMSF (See example 1) and the cells were disrupted with a 
MSE Soniprep 150 sonicator. The cell suspension was sonicated 
with sonication pulses of 5 s for 10 s. The samples were 
cooled on ice between the pulses. The cell debris was removed 
by centrifugation at 13,000 rpm for 30 min at 4°C in a 
Heraeus table top centrifuge. Thev activities of the 
supernatants were determined as in example 1. 

The activity of the cell extracts of the transf ormants was 
assayed as described in example 1. The activities using 
glycine and sarcosine as substrates were typically 
3,000-5,000 dpm/30 min and 1,000-2,000 dpm/30 min., 
respectively. 

Salt tolerance of E. coli clones expressing the E. 
halochloris GSMT 

The strains used in these tests were the positive clone 
designated EGSM and E . coli XLI Blue MRF' transformed with 
the cloning vector PQE-60. The growth medium used in this 
test was the synthetic medium MM63 described by Larsen et al. 
(1987) supplemented with 1.5 mil/1 of vitamin solution VA 
(Imhoff and Tfuper, 19?7) and 100 /zl/ml ampicillin. 

The bacterial strains were grown to mid- exponential growth 
phases with shaking at 180 rpm at 37°C and/centrifuged (1,000 
g, 15 min) . The cells were resuspended in the growth medium 
to absorbance of 0.9 at 600 nm. 

The automatic turbidimetric system Bioscreen (Labsys terns) was 
used for monitoring the growth. 3 0 fil was inoculated to 300 
ixl of the medium with 0 or 0.2 M NaCl and 1 mM IPTG (final 
concentrations). Cells were grown at 37°C with intensive and 
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' continuous shaking. Growth was followed by absorbance 
measurements in Bioscreen at 600 nm. 

The growth curves of EGSM and E . coli transformed with PQE-60 
in a medium without added NaCl are presented in and with 0.2 
M NaCl in Figure 11. It can be concluded from the data that 
EGSM has an increased tolerance towards osmotic stress.^ _ 

Example 8. Expression of E. halochloris SDMT in E.^coli 

The gene encoding the E. halochloris SDMT was amplified by 
PCR. The purified plasmid used for DNA sequencing in example 
6 was used as the template for the PCR reaction. The 
following primers were used in the PCR reaction: 

primer 3 : 

5 1 - GCATGCCATGGCGACGOGCTACGACGATCAA- 3 1 (SEQ ID NO : 26) 
(5 -end oligonucleotide) and 

primer 4: 

5 i -G(^^GATCTCCC1TTGCGGAAGTAAAAGATACC- 3 1 (SEQ ID NO: 27) 
(3 r -end oligonucleotide) 

The primers are homologous to the 5 f - arid 3' -end of the E. 
halochloris SDMT gene. The 5 '-end oligonucleotide hybridizes 
to position 1031-1054 and the 3' -end to the position 1844- 
1867 (Figure 8). The primer hybridizing to the 5 '-end 
contains an extra Ncol restriction site and the 3' -end primer 
a Bglll site which were used for cloning of the fragment. 

The preparation of the DNA construct and the cultivation of 
the positive E. coli clones were done according to example 7. 
A schematic presentation of the expression plasmid (pESDM) is 
shown in Figure 10. 

The cultivations and preparation of the cell-free extracts 
were performed essentially as described in example 7. The 
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sonication of the cell extract pulses was shortened to 3 x 2 
second intervals (total sonication time was 6s). 

The activity of the cell extracts of the transf ormants was 
assayed as described in example 1; The activities using 
sarcosine and dimethyl glycine as substrates were typically 
20,000 dpm/30 min. with both substrates. 

Example 9. Co-expression of E. halochloris GSMT arid SDMT in 
£ . coli 

The DNA construct made for this experiment contains both GSMT 
and SDMT genes separated by a short (3 nucleotides long) 
linker. The DNA fragment was obtained by amplification op the 
purified plasmid used for DNA sequencing in example 6. 

The following primers were used in the amplification: 
primer 1: 

5 1 - CGGACCATGGATACGACTACTGAGCAG- 3 1 (SEQ ID NO : 24) 
(5'~end oligonucleotide) and 

primer 4 : 

5 » -GGGAAGATCTCCCTTTGCGGAAGTAAAAGATACC-3 1 (SEQ ID NO : 27) 
. (3 ' -end oligonucleotide) 

The primers are homologous to the 5 1 -end of the E. 
halochloris GSMT and the 3 ' -end of the E. halochloris SDMT 
gene. The 5' -end oligonucleotide hybridizes to position 221- 
241 and the 3* -end to the position 1844-1867 (See Figure 8) . 
The primer hybridizing to the 5* -end contains an* extra Ncol ... 
restriction site and the 3' -end primer a Bglll site. 

The preparation of the DNA construct and the cultivation of 
the positive E. coli clones were done according to example 7. 
A 'schematic presentation of the expression plasmid (pEhFU) is 
shown in Figure 10. The induction and preparation of the ^ 
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cell- free extracts was performed essentially as described in 
example 9 

The enzymatic activities were assayed as described in. example 
I . The cell extracts of the transf ormants clearly showed 
activity with glycine, sarcosine and dime thy Igly cine. The 
activities of the cell extracts with the three substrates 
were all over 20,000 dpm/30 min. 

Salt tolerance E. coli clones expressing the E. halochloris 
GSMT and SDMT 

The test was performed essentially as described in example 7. 
The positive clone designated EhFU was used in this test. 

The growth curves of EhFU and E. coli with PQE-60 in a medium 
without added NaCl are presented in and with 0.2 M NaCl in 
Figure 12. It can be concluded f rom ^the data that EhFU has an 
increased tolerance to osmotic stress. 

Betaine synthesis in E. coli clones expressing the E. 
halochloris GSMT and SDMT 

The growth medium used in this test was the synthetic medium 
MM63 described by Larsen et al. (1987) supplemented with 1.5 
ml/1 of vitamin solution VA (Imboff and Truper, 1977) and 100 
ftl/ml ampicillin. The medium contained 1% (wlv) glucose. 
■* 

The clone EhFU (Figure 10) and the control strain (E coli 
XLl-Blue MRF* transformed with the cloning vector PQE-60) 
were grown to mid- exponential phase with shaking at 180 rpm 
at 37°C. The cells were centrifuged at 1,000 g for 10 min and 
resuspended in the growth medium so that the turbidity Agoot 
was 0.640; 5 ml of this cell suspension was inoculated to 50 
ml of media containing 0.22 or 0.33 M NaCl and 25 mM 
L-methionine. The bacterial strains were grown for 2 h with 
shaking at 180 rpm at 37 °C and 1 mM IPTG was added. Growth 
was followed by measuring the turbidity at S00 nm. The cells 
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were grown to early stationary phase. Cells from 45 ml of 
culture were harvested by centrifugation at 1,000 g for 15 
min and washed once with the growth medium without glucose. 

The cell pellets were suspended in 2 ml of water and kept in 
a boiling water bath for 10 min. The suspension was 
centrifuged for 15 min at 23,000 g and the supernatant 
collected and the pellet resuspended in water. This 
extraction was repeated twice. The three supernatants were 
combined. The volumes of supernatants were measured and the 
supernatants were filtered and analyzed by HPLC as described 
in example 1. 

A similar experiment was performed, also without added 
L-methidnine . . t r 

The betaine produced inside the cells is presented in table ; 
4. 

Table 4. The amount of betaine synthesized inside the cells 
in l .ml of culture when grown in MM63 





Betaine concentration in the cells 
(Mg/Afionml) 


Trans formant 


NaCl 
(mol/1) 


Medium 

supplemented with 
25 MM methionine 


No methionine 
added 


EhFU 


0 


n.d. 


1.0 




0.2 


3.5 


1.1 




0.3 


2.4 


0.9 


PQE-60 


0.2 


0 


0 




0.3 


0 


0 



The results show that the E. coli clone expressing the S. 
halochloris GSMT and SDMT genes synthesizes betaine in these 
cells. The highest amount of betaine synthesized corresponds 
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roughly to 1% (0.2 M NaCl) and 0.5 % (0.3 M NaCl) of the cell 
dry weight 

Example 10. Expression of the DNA fragment encoding the 
protein isolated as A. halophila SDMT in E.^coli 

The gene sequencing results revealed that a single gene .codes 
for the A halophila GSMT and SDMT. The fusion protein was 
not, however, successfully purified from the A. halophila * 
cell extracts. Instead a protein with SDMT activity was v 
isolated. In this experiment the corresponding part of the 
GSMT- SDMT gene is expressed in E. coli. 

The gene fragment encoding the SDMT enzyme activity was 
amplified by PCR. The genomic DNA from A. halophila isolated 
in example 6 was used as the template for the PCR reaction. 
The following primers were used in the amplification: 

primer 5 : 

5 1 -GCTGCCATGGAGAAGAGCTACCGCACCGAG - 3 1 (SEQ ID NO :28) 
(5' -end oligonucleotide) and 

primer 6 : , 

5 ' -GGGAAGATCTTGCCCTGGCX3TGGATGATGCCCCA-3 1 (SEQ ID NO: 2 9) 

(3 -end oligonucleotide) * 

The primers are homologous to the 5 1 - and 3 '-end of the ASDMT 
gene. The 5 '-end oligonucleotide hybridizes to position 1048- 
1068 and the 3 1 -end to the position 1879-1902 (See Figure 9). 
The primer hybridizing to the 5 '-end contains an extra Ncol 
restriction site and the 3 '-end primer a Bglll site. 

The preparation of the DNA construct and the cultivation of 
the positive E. coli clones were done according to example 7. 
A schematic presentation of the expression plasmid (pASDM) is 
shown in Figure 10. The induction and preparation of the 
cell free extracts was performed essentially as described in 
example 9 . 
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The enzymatic activities were assayed as described in example 
1. The cell extracts of the transf ormants clearly showed SDMT 
activity. The activities on sarcosine and dimethyl glycine 
were typically 20,000 dpm/30 min. with both substrates. There 
was no activity on glycine. 

Example 11. Expression of A. halophila GSMT-SDMT fusion 
protein in E. coli 

The gene fragment encoding GSMT and SDMT of A A. halophila was 
amplified by PCR. Primers used were: 

primer 7 : 

5 ■ - CATGCCATGGCC AAGAGCGTGGAC6ATCTT - 3 ■ (SEQ ID NO : 30 ) 
(5 ' -end oligonucleotide) and 

primer 6 : v 

5 1 -GGGAAGATCTTGCCCTGGCGTGGATGATGCCCCA-3 1 (SEQ ID NO:29) 
(3 -end, oligonucleotide) 

The primers are homologous to the 5 1 - and 3 1 -end of the 
AGSMT-ASDMT gene. The 5 1 -end oligonucleotide hybridizes to 
position. 208-231 and the 3' -end to the position 1879-1902 
(See Figure 9) . The primer hybridizing to the 5 '-end contains 
an extra Ncol restriction site such that the nucleotide A at 
position 211 in Figure 9 is replaced by the nucleotide G in 
the primer and the 3' -end primer contains a Bglll site. 

The purified plasmid used for DNA- sequencing in example 6 was 
used as a template for the PCR reaction. The amplification 
was performed in following conditions: 34 cycles of 30 s at 
94°C for denaturation, 1 min at 50°C annealing and 2 min at 
72°C for synthesis. Ligation of the amplif ication product 
into Ncol/Bglll cut PQE-60 and the transformation of XL-1 
Blue MRF 1 cells was performed as in example 7 . A schematic 
presentation of the expression plasmid is shown in Figure 10 . 
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The induction and preparation of the cell- free extracts was 
performed essentially as in example 9 except that the 
sonication pulses were shortened to 2 s and. the total 
sonication time to 6 s. The cell-free extract was analyzed by 
SDS-polyacrylamide gel electrophoresis as in example 2. Hie 
pellet from the centrifuged suspension was suspended to 10 mM 
Tris-HCl-buf fer, pH 8.0 containing 8 M urea and 0.1 M Na3P0 4 
to solubilize the proteins of the pellet and centrifuged for / 
15 min in a Heraeus table top centrifuge at 13,000 rpm. Hie 
supernatant s were analyzed by SDS-polyacrylamide gel 
electrophoresis as in example 2 . 

-■' ' ' ■ ■ . /■' 

The enzymatic activities were assayed as described in example 
1. The cell extracts of the transf ormants clearly showed SDMT 
activity. The activities on sarcosine and dimethyl glycine 
were typically 10,000 dpm/30 min and 20,000 dpm/30 min., 
respectively. There was no activity on glycine. 

The SDS-polyacrylamide gel of the cell- free extract showed no 
major protein band of correct size. However, the insoluble - 
pellet solubilized with 8 M urea showed a major band 
corresponding to the molecular weight of the GSMT- SDMT fusion 
protein. The results indicate that when A. halophila GSMT- 
SDMT, is over-expressed in E. coli it forms inclusion bodies. 
However, a fraction of. the protein - which corresponds the 
SDMT - is proteolytically cleaved and remains soluble in the 
cells. 

Example 12. Expression of E. halochlqris GSMT and SDMT in 
tobacco and potato ( 

Tobacco and potato, plants can be transformed by A^robacteri uzn 
mediated transformation system. Identical DNA construct can 
be used for both plants . 

The GSMT gene is first transformed into the plant using a 
plasmid containing a kanamycin resistance marker. Positive 
transgenic. plants obtained by screening for the enzyme 
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activity are then used as host plants for second \ 
transformation of the SDMT gene. Another selection marker, 
hygromycin selection is used in the second transformation. 
Experiments are performed using stable transf ormants of the 
Fx generation. 

The genes ^ of E. halochloris GSMT and SDMT are amplified by 
PCR by using plasmid pEFU (see example 10) as the^ template. 
The primers used hybridize to the same regions of the DNA as 
shown in Fig. 8 (GSMT: primer 1 and primer 2; SDMT; primer 3 
and primer 4K The final DNA constructs are made using 
suitable restriction sites to transfer the genes to plant 
transformation vectors. PBin 19 based pGPTV vectors (Becker 
et al, 1992) are used which have a strong 35S promoter and 
the CaMv polyadenylation signal. 

The resulting plasmids are transformed to A. tumefaciens 
strains" Strain EHA 105 (Hood, E.E. et al., (1993) is used 
as a vector to transform tobacco basically as described by 
Rogers et al. (1986). Strain ClC58p-GV3850 (Zambryski et al.,:* 
(1983) ; Van Larabece et al-, (1974)) is used as an 
alternative host to transform potato Solanxm tuberosum 
(Desir§) according to Diet ze et al. (1995),. 

■ . ■ ^ 

The transf ormants are analyzed by Southern blot analysis to 
check for the presence of the genes. PCR-amplif ied, DIG- 
labelled (Boehringer) 200 bp gene fragments are used as a 
probe.' The enzymatic activities of the cell extracts of 
transgenic plants and the levels of sarcosine, dimethyl 
glycine and betaine are analyzed as described in Example 1. 

Stress tolerance, for example, tolerance to drought, 
salinity, cold or freezing, resistance to pathogens, etc., is 
determined according to methods known in the art, for 
example, methods described in the technological background 
section of the present application. 
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Example 13. Expression of E. halochloris GSMT and SDMT in 
rice 

The plasmid constructions described in Example 13 are also 
used to' transfer the GSMT and SDMT genes to rice by particle, 
bombardment. The GSMT are transferred to rice first and 
positive regenerated transformants are used as host plants 
for the SDMT transformation. 

The following procedure are used. Immature Oryza sativa 
embryos of the Japonica variety Taipei 309 are aseptically 
isolated 10-14 days after pollination from greenhouse plants 
and plated scutulum site up on solid MS medium (Murashige and 
Skoog, 1962) containing 3% sucrose, 2 mg/1 2,4- 
dichlorophenoxyacetic acid and 50 mg/1 cefotaxime (MSI). 
After 4-6 days (28°C, darkness) embryos are transferred to 
solid MS medium containing 10% sucrose, 2 mg/1 2,4- 
dichlorophenoxyacetic acid and 50 mg/1 cefotaxime (MS2) and 
subjected within 1 hour to microprojectile bombardment with a 
particle inflow gun. The DNA fragment containing the 
mehtyltransf erase gene and the selective marker (5 /zg) is 
precipitated on 1-3 mm gold particles (Aldrich) as described 
by Vain et al., (1993). Gold particles (400 mg per 
bombardment) are accelerated to the target with a particle 
inflow gun (Finer et al. , (1992) at a pressure of 6 bar. 
Embryos are placed 16 cm below the syringe filter. Twenty 
four hours post -bombardment embryos are subjected to 
selection on solid media (containing hygromycin or kanamycin) 
and incubated at 28 °C in the dark. 

After one week embryos are transferred to a liquid selection 
media, R2 medium (Ohira et al., (1973) supplemented with: 3% 
sucrose, 1 mg/1 thiamine, 1 mg/1 2,4-dichlorophenoxyacetic 
acid, 50 mg/1 cefotaxime and 20 mg/1 hygromycin B or 
kanamycin. The embryos are incubated with shaking at 28 °C in 
the dark and subcultured weekly. Developing calli are 
isolated 3 to 6 weeks later, and transferred to a callus 
increasing media (R2 medium supplemented, with: 6% sucrose, MS 
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vitamins, 100 mg/1 inositol, 2 tng/1 2, 4-dichlorophenoxyacetic 
acid, 50 mg/1 cefotaxime and 20 mg/1 hygromycin B kanamycin) . 
The calli are incubated in this media at 28 °C in the dark and 
subcultured weekly. 

j ■ * , - 

Resistant calli are transferred to solid R2 regeneration 
media supplemented with 2% sucrose, 3% sorbitol, 20 mg/1 
hygromycin B, 1 mg/1 zeatin, 0.5 rag/ l indole -3 -acetic acid, 
MS vitamins and 0.65% agarose. The callus tissue is 
maintained at 28°C with 12 h of light in order to enhance 
shoot formation. The calli are then subcultured every 3 j 
weeks until shoots had reached a length of 2-3 cm. They are 
transferred to half -strength MS rooting medium without 
hormones, supplemented with 1.5% sucrose and 6.3% gelriteR 
(Sigma) . After 2-4 weeks of cultivation, plantlets are 
transferred directly to the green- house and planted in soil. 
Plantlets are grown. in, 7 liter aquaculture pots with 
fertilizer enriched earth, 3 plants per pot (day: 12 h, 28 °G, 
80% humidity; night: 12h, 2l°X, 60% humidity) until they 
flower and set seeds." 

To check the presence of the transgene, complexity of 
insertion (s) and number of copies present, Southern blot 
analysis is performed as described previously (Burkhardt et 
al., 1997 \. A PCR amplified, DIG-labelled (Boehringer) 200 -bp 
fragment of the coding region of the GSMT or SDMT genes is 
used as a probe. 

The enzymatic activities of the cell extracts of transgenic 
plants and the levels of sarcosine, dimethyl glycine and 
betaine are analyzed as described in Example 1. 

Stress tolerance, for example, tolerance to drought, 
salinity, cold or freezing., resistance to pathogens, etc., is 
determined according to methods known in the art, for 
example, methods described in the technological background 
section of the present application. 
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Example 14. Expression of the E. halochloris GSMT and SDMT in 
yeast 

pYX242 plasmid (R&D systems, USA) was used for expressing the 
GSMT and SDMT genes in Saccharomyces cerevisiae. The plasmid 
used (pYX242) is a Ei col i- Saccharomyces cerevisiae shuttle 
vector containing a bacterial origin of replication and 
ampicillin resistance gene, a yeast [S. cerevisiae) origin of 
replication from 2/mi DNA, and the --yeast LEU2 gene for 
selection in yeast. The two genes are expressed under the 
yeast triose phosphate isomerase (TPI) promoter. 

The DNA of the plasmid pEFU described in Example 10 was used 
as the template of PCR reactions. The primers used hybridize 
to DNA sequences shown in Fig. 8 (primer 1 and primer 4) and 
thus amplify both the GSMT and SDMT genes. The PCR fragment 
was ligated to the promoter of the expression plasmid with 
standard methods . A fragment containing a TPI transcription 
terminator and a fragment containing the TPI promoter was • 
ligated between the two genes. Thus, both genes are • 
expressed under the TPI promoter. The primers used in the 
amplification of the fragments ligated in the expression \ 
plasmid contained suitable restriction sites that were used 
in the cloning. S. cerevisiae GRF18 (MATct, leu2-3, 11, his3- 
11.15) was used as the host for transformation. The 
transformation was performed according to Ito et al . using 
the standard lithium chloride procedure (Ito et al. , (1983) 
Bacteriol. 153, 163-170) using the LEU2 marker of pYX242 for 
selection. 

The transformants were grown in YNB-medium supplemented with 
amino acid mixture without leucine (R&D systems product 
manual) . The cultivation was, done overnight at 30°C by 
shaking at 180 rpm. 5 ml of culture supernatant was 
centrifuged (1,700 g, 10 min) and the cell pellet was 
suspended in 200 ptl of the assay buffer (see example 1) 
supplemented with l mM PMSF. The cells were broken by 
vortexing with glass beads (10 x l min intervals) . The cells 
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were kept on ice between the pulses. The cell debris was 
centrifuged down (30 min, 4°C, 10,000g). 

The methylase activities were assayed from the supernatant as 
described in example 1. The relative enzyme activities on 
different substrates were the following: glycine - 17,000 
dpm/30 min; sarcosine - 71,000 dpm/30 min and dimethyl 
glycine - 530,000 dpm/30 min. 
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What is claim d: 

1. A nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
' ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

-(c) a fragment of a nucleotide sequence as defined in 
- (a) and (b) , and 
(d) a. nucleotide sequence, which is degenerate as a 

result of the genetic code to any of the nucleotide 
~" sequences defined in (a),- (b) and (c) . 

2 . A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 208 to 
1047 of SEQ ID NO:l,. 

3. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 221 to 
1024 Of SEQ ID NO: 5. 

4. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 1048 to 
1902 Of j SEQ ID N0:1. ' 

5. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from -nucleotide 1031 to 
1867 Of SEQ ID NO: 5. 
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6. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 208 to 
1902 Of SEQ ID N0:1, 

7. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 221 to 
1867 Of SEQ ID N0:5. 

8. A nucleic acid molecule according to claim 1 which 
comprises the DNA sequence from nucleotide 208 to 
2722 of SEQ ID NO:l/ 

9. "A nucleic acid molecule according to claim 1 which . 

comprises the DNA sequence from nucleotide 221 to 
3004 Of SEQ ID NO:5. , 

10 • A nucleic acid molecule encoding a 

methyl transferase capable of catalyzing at least 
one of the reactions glycine to sarcosine, 
sarcosine to dimethyl glycine and dimethyl glycine 
to betaine. 

li. A nucleic acid molecule comprising a nucleotide 
sequence selected form the group consisting of: 

(a) a DNA sequence from nucleotide 2027 to 2722 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 2006 to 3004 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined^ in 
(a) and (b) , and 

<d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) . 
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12 . A nucleic acid molecule according to claim 11 which 
comprises the DNA sequence from nucleotide 2027 to 
2722 of SEQ ID NO : 1 • ' 

13. A nucleic acid molecule according to claim 11 which 
comprises the DNA sequence from nucleotide 2006 to 
3004 of SEQ ID NO: 5. 

14. A methyltransf erase encoded by a nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ / 
ID N0:1, 

a DNA sequence from nucleotide, 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of . 
the sequences defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , ■ j 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to, any of the nucleotide 
sequences defined in (a), (b) and (c) . 

15. A methyltransf erase according to claim 14 encoded 
by a nucleic acid molecule having the DNA sequence 
from nucleotide 208 to 1047 of SEQ ID NO:l. 

16 \ A methyltransf erase according to claim 14 encoded 
by a nucleic acid molecule having the DNA sequence 
from nucleotide 221 to 1024 of SEQ ID N0:5. 

17. A methyltransf erase according to claim 14 encoded 

by a nucleic acid molecule having the DNA sequence ' 
from nucleotide 1048 to 1902 of SEQ ID N0:1. 
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18 . A methyltransf erase according to claim 14 encoded 
by a nucleic acid molecule having the DNA sequence 
from nucleotide 1031 to 1867 of SEQ ID N0:5. 

19. A methyltransf erase comprising an amino acid 
sequence selected from. the group consisting of: 

(a) an amino acid sequence as depicted in SEQ ID N0:2, 
an amino acid sequence as depicted in SEQ ID N0:3, 
an amino acid sequence as depicted in SEQ ID NO: 6, 
an amino acid sequence as depicted in SEQ ID NO: 7, 

(b) a fragment of an amino acid sequence as defined in 
(a) and 

(c) a derivative of an amino acid sequence as defined 
in (a), 'and (b) . 

20. A methyltransf erase according to claim 19 having 
the amino acid sequence depicted in SEQ ID NO: 2. 

21. A methyltransf erase according to claim ,19 having 
the amino acid sequence depicted in SEQ ID NO: 3. 

22. A methyltransf erase according to claim 19 having 
the amino acid sequence depicted in SEQ ID NO: 6. 

23. A methyltransf erase according to claim 19 having 
the amino acid sequence depicted in SEQ ID NO: 7. 

24 . A methyltransf erase according to claim 19 having 
the amino acid sequence depicted in SEQ ID N0:2 and 
SEQ ID N0:3, whereiin the N- terminus of SEQ ID NO: 3 
is covalently joined to the C- terminus of SEQ ID 
N0:2. 

25. A methyl transferase capable of catalyzing at least 
one of the reactions glycine to sarcosine, 
sarcosine to dimethyl glycine and dimethyl glycine 
to betaine. 
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26. A S-adenosyl methionine synthase encoded by a 
nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of:- 

(a) a DNA sequence from nucleotide 2027 to 2722 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 2006 to 3004 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) . 

27. A S-adenosyl methionine synthase encoded by a 
nucleic acid molecule having the DNA sequence from 
nucleotide 2027 to 2722 of SEQ ID NO:l. 

28. A S-adenosyl methionine synthase encoded by a 
nucleic acid molecule having the DNA sequence from 
nucleotide 2006 to 3004 of SEQ ID NO: 5. 

29. A S-adenosyl methionine synthase comprising an 
amino acid sequence selected form the group 
consisting of: 

(a) an amino acid sequence as depicted in SEQ ID NO: 4 
or 

an amino acid sequence as depicted in SEQ ID NO: 8, 

(b) a fragment of an amino acid sequence as defined in 
(a) and 

(c) a derivative of an amino acid sequence as defined 
in (a) and (b) . 

30. A S-adenosyl methionine synthase according to claim 
26 having the amino acid sequence depicted in SEQ 
ID NO:4. 
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31. A S-adenosyl methionine synthase according to claim 
26 having the amino acid sequence depicted in SBQ 
ID NO: 8. 

32. An expression vector comprising at least one 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SBQ 

ID NO:l, N 
a DNA sequence from nucleotide 221 to 1024 of SBQ 

ID NO: 5, 

a DNA sequence from nucleotide 1048 .to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 

. ' ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

33. An expression vector according to claim 32 
comprising the DNA sequence from nucleotide 208 to 
1047 Of SEQ ID NO : 1 . 

34. An expression vector according to claim 32 
comprising the DNA sequence from nucleotide 221 to 
1024 Of SEQ ID NO: 5. 

35. An expression vector according to claim 32 
comprising the DNA sequence from nucleotide 1048 to 
1902 Of SEQ ID NO:l. 
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36. An expression vector according to claim 32 
comprising the DNA sequence from nucleotide 1031 to 
1867 Of SEQ ID NO : 5 . 

37. An expression vector' according to claim 32 
comprising the DNA sequence from nucleotide 208 to 
1902 Of SEQ ID NO:l. 

t 

38. An expression vector according to claim 32 
comprising the DNA sequence from nucleotide 221 to 

f 1867 of SEQ ID NO:5. . 

\ 

39 An expression vector comprising a nucleotide 

sequence coding for an enzyme capable of increasing 
the amount of intracellular S-adenosyl methionine 
and at least one nucleotide sequence selected from 
the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of/SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221. to 1024 of SEQ 
ID NO:5, 

. ' a DNA sequence from nucleotide 1048 to 1902 of SEQ 

ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) . 

40. A recombinant organism transformed with at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 
(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, ' 
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a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

41 . A recombinant organism according to claim 40 which 
is transformed with an expression vector comprising 

: - the DNA sequence from nucleotide 208 to 1047 of SEQ:; 
ID N0:1. 

42. A recombinant organism according to claim 40 which 
is transformed with an expression vector comprising 
the DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5. 

43 . A recombinant organism according to claim 40 which 
is transformed with an expression vector comprising 
the DNA sequence from nucleotide 1048 to 1902 of 
SEQ ID N0:1. 

44. A recombinant organism according to claim 40 which 
is transformed with an . expression vector comprising 
the DNA sequence from nucleotide 1031 to 1867 of 
SEQ ID N0:5. 

45. A recombinant organism according to claim 40 which 
is transformed with an expression vector comprising 
the DNA sequence from nucleotide 208 to 1902 of SEQ 
ID NO: 1- 
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46. A recombinant organism according to claim 40 which 
is transformed with an expression vector comprising 
the DNA sequence from nucleotide 221 to 1867 of SEQ 
ID NO: 5. 

47.. A recombinant organism transformed with an nucleic 
acid molecule comprising a nucleotide sequence 
/ coding for an enzyme capable of increasing the 
amount of intracellular S-adenosyl methionine and 
at least "one nucleotide sequence selected from the 
group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ J 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide • 
sequences defined in (a), (b)and (c) . 

48. A recombinant organism transformed with an nucleic 
acid molecule comprising a nucleotide sequence 
coding for an enzyme capable of increasing the 
amount of intracellular glycine and at least one 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ. 
ID NO: 5, 
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a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

~(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , • (b)and (c) . . .' 

49. A recombinant organism according to claim 40, 
wherein the organism is a bacteria . 

50/ A recombinant organism according to claim 49, 

wherein the bacteria is selected from the grpup _ 

consisting of E. coli, Bacillus, Corynebacteria, 

Pseudomonas lactic acid bacteria and Streptomyces. 

51. A recombinant organism according to claim 40, 
wherein the organism is a yeast. 

52 A recombinant organism according to claim 51, 
wherein the yeast is selected from the group 
consisting of Saccharomyces, Zygosaccharomyces, : 
Pichia, Kluyveromyces, Candida and Hansenula. 

53. A recombinant organism according to claim 40, 
wherein the organism is a fungus. - x 

54. A recombinant organism according to claim 53, 
wherein the fungus is selected from the group 
consisting of Aspergillus, Trichbderma and 
Penicillium. 

55. A recombinant organism according to claim 40, 
wherein the organism is a plant selected from the 
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group comprising cereals, legumes, oilseeds, 
vegetables, fruits, ornamentals and perennial 
trees. 

56. A recombinant organism according to claim 55, 
wherein the plant is selected from the group 
consisting of lettuces, Capsicums, grasses, 
clovers, alfalfa, beans, sweet potatoes, cassava, 
yams, taro, groundnut, brassica, sugar beet, 
grapes,. potato, tomato, rice, tobacco, rapeseed,, 
maize, sorghum, cotton, soybean, barley, wheat, 
rye, canola, sunflower, linseed, pea, cucumber, 
carrot, ornamentals, perennial trees and fruits. 

57 A recombinant organism according to claim 47, 
wherein the organism is a bacteria. 

513. A recombinant organism according to claim 57, 

wherein the bacteria is selected from the group 
consisting of E. coli, Bacillus, Corynebacteria, 
Pseudomonas lactic acid bacteria and Streptomyces. 

59. A recombinant organism according to claim 47, 
wherein the organism is a yeast. 

60. ' A recombinant organism according to claim 59, 

wherein the yeast is selected from the group 
consisting of Saccharomyces, Zygosaccharomyces, 
Pichia, Kluyveromyces, Candida and Hansenula. 

61. A recombinant organism according to claim 47, 
wherein the organism is a fungus. 

62. A recombinant organism according to claim 61, 
wherein the fungus is selected from the group 
consisting of Aspergillus, Trichoderma and 
Penicillium. 
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63. A recombinant organism according to claim 47, 
wherein the organism is a plant selected from the 
group comprising cereals, legumes, oilseeds, 
vegetables, fruits, ornamentals and perennial 
trees. , 

64. A recombinant organism according to claim 63, 
wherein the plant is selected from the group 
consisting of lettuces, Capsicums, grasses; 
clovers, alfalfa, beans, sweet potatoes, cassava, 
yams,, taro, groundnut, brassica, sugar beet, 

' grapes, potato, tomato, rice, tobacco, rapeseed, 
maize, sorghum, cotton, soybean, barley, wheat, 
rye, canola, sunflower, linseed, pea, cucumber, 
carrot, ornamentals, perennial trees and fruits. 

65. A recombinant organism according to claim 48, . 
wherein the organism is a bacteria. , 

66. A recombinant organism according to claim 66, 
wherein the bacteria is selected from the group 
consisting of E. coli, Bacillus, Corynebacteria, 
Pseudomonas lactic acid bacteria and Streptomyces 

67. A recombinant organism according to claim 48, 
wherein the organism is a yeast. 

68. A recombinant organism according to claim 67, 
wherein the yeast is selected from the group 
consisting of Saccharomyces, Zygosaccharontyces, 
Pichia, Kluyveromyces, . Candida and Hansenula. 

69. A recombinant organism according to claim 48, 
wherein the organism is a fungus. 
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A recombinant organism according to claim 69, 
wherein the fungus is selected from the group 
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consisting of Aspergillus, Trichoderma and 

Penicillium. ( 

■ ' (* 

71. a recombinant organism according to claim 48, 
wherein the organism is a plant selected from the 
group comprising cereals, legumes, oilseeds, 
vegetables, fruits, ornamentals and perennial 
trees . ^ 

■ ' r 

72. A recombinant organism according to claim 71# 
wherein the plant is selected from the group 
consisting of lettuces, Capsicums/ grasses, 
clovers, alfalfa, beans, sweet potatoes, cassava, 
yams, taro, groundnut", brassica, sugar beet, 
grapes, potato, tomato, rice, tobacco, rapeseed, 
maize, . sorghum, cotton, soybean, barley, wheat, 
rye, canola, sunflower, linseed, pea, cucumber, 
carrot, ornamentals, perennial trees and fruits. 

73. A method for the production of a recombinant 
organism comprising the steps of transforming a 
host organism with at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 
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(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

74 . A method for the production of a recombinant 
organism comprising the steps of transforming a 
host organism with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one a nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5 , 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) . 

75 . A method for the production of a recombinant 
organism comprising the steps of transforming a 
host organism with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount of 
intracellular glycine and at least one a nucleotide 
sequence selected from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 
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a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide. sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) 

76. A methyl transferase obtainable by culturing an 

organism transformed- with at least one nucleic acid 
molecule comprising a nucleotide sequence "selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

• a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ \ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) 

and isolating the methyltransferase from said 
organism or the medium used to culture said 
organism.' 
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77 A method for the production of a methyltransferase 
comprising the steps of culturing an organism 
transformed with at least one nucleic acid molecule 
comprising a nucleotide sequence selected from the 
group consisting of: . / 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO: 1, * 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:"5, 

a DNA sequence from nucleotide 1048 to 1902 of SBQ * 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide, sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code, to any of the nucleotide 
sequences defined in (a), (b)and (c) , 

and 1 isolating the methyltransferase from said 
organism or the medium used to culture said 
organism. 

78. A method for the production of sarcosine comprising 
the steps of culturing an organism transformed with 
a nucleic acid molecule comprising a nucleotide., 
sequence selected from the group^consisting of : .. 

(a) a DNA sequence' frotn nucleotide 208 to 1047 of SEQ 
■ ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ ^ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 
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(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b) and (c) , 
and isolating dimethyl glycine from said organism 
or the medium used to culture or process said 
organism. . 

79. A method for- the production of sarcosine comprising 
the steps of culturing an organism transformed with 
a nucleic acid molecule comprising a nucleotide 
sequence coding for an enzyme capable of increasing 
the amount of intracellular S-adenosyl methionine 
and at least one a nucleotide sequence selected 
from the group consisting of : 

(a) a . DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c), 

and isolating sarcosine from said organism or the 
medium used to culture or process said organism. 

80. A method for the production of sarcosine comprising 
the steps of culturing an organism transformed with 
a nucleic acid molecule comprising a nucleotide 
sequence coding for an enzyme capable of increasing 
the amount of intracellular glycine and at least 
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one a nucleotide sequence-selected from the group 
consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SBQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to. 1024 of SBQ 
ID NO: 5, 

- a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , ; 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences definedjin (a) , (b)and (c) , 

and isolating sarcosine from said organism or the 
medium used to culture or process said organism. 

81 . A method for the production of dimethyl glycine 
' comprising the steps of culturing an organism 
transformed with a nucleic acid molecule comprising 
a nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) i a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c), 
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and isolating dimethyl glycine from said organism 
or the medium used to culture or process said 
organisms. 

82 . A method for the production of dimethyl glycine 
comprising, the steps of culturing an organism 
transformed with a nucleic acid molecule comprising 
a nucleotide sequence coding for an enzyme capable 
of increasing the amount of intracellular S- 
adenosyl methionine and at least one a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO: 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID N0:5, ^ 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 

ID N0:1, ' .. - 

a DNA sequence from nucleotide 1031' to 1867 of SEQ 

ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , 

and isolating dimethyl glycine from said organism 
or the medium used to culture or process said 
organism. \ ' 

83. A method for the production of dimethyl glycine 
comprising the steps of culturing an organism 
transformed with a nucleic acid molecule comprising 
a nucleotide sequence coding for an enzyme capable 
of increasing the amount of intracellular glycine 
and at least one a nucleotide sequence selected 
from the group consisting of : 
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(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO : 5 , 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 1 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , 

and isolating dimethyl glycine from said organism 
or the medium used to culture or process said 
organism. . 

84. A method for the production of betaine , comprising 
the steps of culturing an organism, transformed with 
at least one nucleic acid molecule comprising a 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l. 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 
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(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , 
and isolating betaine from said organism or the 
medium used to culture or process said organism. 

B5. A method for the production of betaine, comprising 
the steps of culturing an organism transformed with 
- a nucleic acid molecule comprising a nucleotide 
sequence coding for an enzyme capable of increasing 
the amount of intracellular S-adenosyl methionine 
and at least one nucleotide sequence selected from 
the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SBQ 
'ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) ,- ' 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a . 
result of the genetic code to any of the nucleotide 

' sequences defined in (a), (b)and (c) , 

and isolating betaine from said organism or the 
medium used to culture or process said organism. 

86. A method for the production of betaine, comprising 
the steps of cuituring an organism transformed with 
a nucleic acid molecule comprising a nucleotide 
sequence coding for an enzyme capable of increasing 
the amount of intracellular glycine and at least 
one nucleotide sequence selected from the group' 

- 

consisting of: 
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(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO:l, . 

a DNA sequence from nucleotide 221 to. 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

-(b) a nucleotide sequence which hybridizes to any. of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

•(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a) , {b)and (c) , 
and isolating betaine from said organism or the 
medium used to culture or process said organism.. 

87. A method for increasing the intracellular . • .. • 
concentration of sarcosine, dimethyl glycine or 
betaine in an organism comprising the steps of 
transforming an organism with at least one nucleic 
acid molecule comprising a. nucleotide sequence 
selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
•ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID NO: 5, v 
a DNA sequence from nucleotide 1048 to 1902 of SEQ 

ID N0:1, . - 

a DNA sequenbe from nucleotide 1031 to 1867 of SEQ 

ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic^ code to any of the nucleotide 
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sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

88. A method for increasing the intracellular 
concentration of sarcosine, dimethyl glycine or 
betaine in an organism comprising the steps of 
transforming an organism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleotide sequence selected from the group 
consisting of; 

(a) a DNA sequence from nucleotide 208 to' 1047 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID- NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence. as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

89. A method for increasing the intracellular 
concentration of sarcosine, dimethyl glycine or 
betaine in an organism comprising the steps of 
transforming an organism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular glycine and at least one nucleotide 
sequence selected from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 
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a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5«, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

-(b) a nucleotide sequence which hybridizes to any of ^ 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . > r 

90. A method for enhancing the salt tolerance of an 
organism comprising the steps of transforming an 
organism with at least one nucleic acid molecule . 
comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of- SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, ' " 

.a' DNA sequence from nucleotide 1048 to 1902 of SEQ - 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in. (a) , 

(c) a fragment of a nucleotide sequence as defined in . 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed, 

91. A method for enhancing the salt tolerance of an 
organism comprising the steps of transforming an 
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organism with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
increasing the amount of intracellular S-adenosyl 
^ methionine and at least one nucleic 'acid molecule 

comprising a nucleotide* sequence selected from the 
group consisting of: t 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
^ ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide' sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide' 
sequences defined in (a), (b)and (c) . 

92. A method for enhancing the salt tolerance of an 

organism comprising the steps of transforming an:, 
organism with a nucleic acid molecule comprising a 
* nucleotide sequence coding for an enzyme capable of. 
increasing the amount of intracellular glycine and 
at least one nucleic acid molecule comprising a 
nucleotide sequence selected from the group 
( consisting of,: ■■ . , 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 
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(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) . 

93 . A method for enhancing the freezing or cold 
tolerance of an organism comprising the steps of 
transforming an organism with at least one nucleic 
acid molecule comprising a nucleotide sequence 
selected from the 'group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

94. A method for enhancing the freezing or cold 
tolerance of an organism comprising the steps 
transforming an organism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 
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(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:!, * . 

a DNA sequence from nucleotide 221 to. 1024 of SEQ 
ID NO:5 f 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and \' 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide, 
sequences defined in (a), (b)and (c) . 

95. A method for enhancing the freezing or cold 

tolerance of an organism comprising the steps 
transforming an organism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular glycine and at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID NO: 5, ^ 
a DNA sequence from nucleotide 1048 to 1902 of SEQ 

ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , , and 
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(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
^ sequences defined in (a), (b)and (c) . 

96. A method ! f or enhancing the resistance of an 
organism to drought or water stress comprising the 
steps of transforming an organism with at least one 
nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, . 

a DNA sequence from nucleotide 221 ~to 1024 of SEQ 
ID NO": 5, 

a DNA. sequence from nucleotide 1048 to 1902 of SEQ 
IDNO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 

ID NO:5, ... - 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) > and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b) and (c) , such that the 
nucleic acid molecule (s) is expressed. 

97. A method for enhancing the resistance of an 

. organism to drought or water stress comprising the 
steps of transforming an organism with a nucleic 
acid molecule comprising a nucleotide sequence 
coding for an enzyme capable of increasing the 
amount of intracellular S-adenosyl methionine and 
at least one nucleic acid molecule comprising a 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 1 
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t ' ' ' . 

a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ . 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and . 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and' (c) . 

98. A method for enhancing the resistance of an 

organism to drought or water stress comprising the 
steps of transforming an organism with a nucleic 
acid molecule comprising a nucleotide sequence 
coding for an enzyme capable of increasing the 
amount of intracellular glycine and at least one 
nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
] ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 
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99. A method for enhancing the productivity or yield of 
an organism comprising the steps of transforming an 
organism with at least one nucleic acid molecule 
comprising a nucleotide sequence selected from the 
group consisting of,: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide^ 1048 to 1902 of SEQ 
ID NO: 1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, ^ 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and .„ 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

100. A method for enhancing the productivity or yield of 
an organism comprising the steps of transforming an 
organism with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
increasing the amount of intracellular S-adenosyl 
methionine and at least one nucleic acid molecule 
comprising a nucleotide sequence selected from the 
group consisting of: r 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO: 1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO : 5 / 1 , 
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(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in - 
(a) and (b) , and ■ ■ 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) . 

101. A method for enhancing the productivity or yield of 
an organism comprising the steps of transforming an 
organism with a nucleic acid molecule comprising a 
nucleotide < sequence coding for an enzyme capable of 
increasing the amount of intracellular glycine and 
at least one nucleic acid molecule comprising a 
nucleotide sequence selected from the group 
consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ - 
ID NO:l # 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, { 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , / 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

102 . A method for inducing pathogenesis-related proteins 
in a plant comprising the steps of transforming a 
plant with at least one nucleic acid -molecule 
comprising a nucleotide sequence selected from the 
group consisting -of : ^ 
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(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO: 1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide^ sequence which is degenerate as a , 
result of the genetic code to any of the nucleotide 
sequences .defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

103. A method for inducing pathogenesis -related proteins 
in a plant comprising the steps of transforming a 
plant with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
increasing the amount of intracellular S-adenosyl. 
methionine and at least one nucleic acid molecule, 
comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 104 8 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

<b) a nucleotide sequence which hybridizes to any of 

the sequences as defined in (a) , 
(c) a fragment of a nucleotide sequence as defined in 

(a) and (b) , and 
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(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) . 

104. A method for inducing pathogenesis-related proteins 
in a plant comprising the steps of transforming a 
plant with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
increasing the amount of intracellular glycine and 
at least one nucleic acid molecule .comprising a 

. nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO:l, . " ' 

a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID NO: 5, 

a DNA sequence from nucleotide 1048. to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which' hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a. nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

105. A method for increasing the resistance of a plant 
1 . to attack by pathogens comprising the steps of 

transforming a plant with at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of : 7 
(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID 'NO: 5 , . . 
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a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) ' a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

106. A method for increasing the resistance of a plant 
to attack by pathogens comprising the steps of 
transforming a plant with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c). 
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107. A method for increasing the resistance of a plant 
to attack by pathogens comprising the steps of 
transforming a plant with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of , increasing the amount of 
intracellular glycine and at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) ' a nucleotide sequence, which hybridizes to any of-. 

the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

'(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

108. A method for improving the nutritional value of a 
plant comprising the steps of transforming a plant 
with at least one nucleic acid molecule comprising 
a nucleotide sequence selected from the group 
consisting of : 

(a) a DNA sequence from nucleotide 2 08 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, . 

a DNA" sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID.NChS,, 
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(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequence's defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

109. A method for improving the nutritional value of a 
plant comprising the steps of transforming a plant 
with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
increasing the amount of intracellular S-adenosyl 
methionine and at least one nucleic acid molecule 
comprising a nucleotide sequence selected from the 
group, consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ v 
ID NO:l, . r , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ ; 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , arid 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

110. A method for improving the nutritional value of a 
plant comprising the steps of transforming a plant 
with a nucleic acid molecule comprising a 
nucleotide sequence coding for an enzyme capable of 
^increasing the amount of intracellular glycine and 



WO 00/11142 ' PCT/EP99/06037 

• 105 

at least one nucleic acid molecule comprising a 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ ^ 
ID NO : 1 / 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5 , 

a DNA* sequence from nucleotide 104 ? 8 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
, ID NO : 5 , 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

Ill . A method for enhancing the pH tolerance of a 

cultured microorganism comprising the steps of 
transforming a microorganism -with at least one 
nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ { 
ID NO:5, 

a DNA sequence from nucleotide 1048 to .1902 of SEQ. 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5/ 

(bp a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as. defined in 

(a) and (b) , and 

/ • ■ 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
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sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecule (s) is expressed. 

112. A method for enhancing the pH tolerance of a 
cultured microorganism comprising the steps of 

* transforming a microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of . increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SBQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, . 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

113. A method for enhancing the pH tolerance of a 
cultured microorganism comprising the steps of 
transforming a microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular glycine and at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide .208 to 1047 of SEQ 
ID NO: 1 , ♦ • ■ - ' ■ 
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a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID' NO: 5, ■ 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 td 1867 *of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of \ 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in v 
(a) and (b) , and * > 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

114. A method for improving the viability of a cultured 
microorganism comprising the steps of transforming 
an microorganism .with, .at least one nucleic acid, 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID* NO : 1 , 

a DNA sequence from nucleotide 221 to 1 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), , * 

•(c) a fragment of a nucleotide sequence as defined in 

(a) and (b) , and 
(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in .(a) , (b)and (c) , such that the 
nucleic acid molecule(s) is expressed. _ 

115. A method for improving the viability of a cultured 
microorganism comprising the steps of transforming 
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a microorganism with a nucleic acid molecule 
comprising a nucleotide sequence coding tor an 
enzyme capable of increasing thie amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
.. ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, * 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in. 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to ^ariy of the nucleotide 
sequences defined in (a), (b)and (c) . 

A method for improving the viability of a cultured 
microorganism comprising the steps of transforming 
a Inicroorganism with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount "of 
intracellular glycine and at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1 # 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5 # 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5 ( 



116. 



(a) 
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(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a)', '(b) and (c) * 

117 . A method for decreasing inclusion body formation in 
a microorganism expressing a heterologous protein 
comprising the steps 7 of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
transforming said microorganism with at least one 
nucleic acid molecule comprising a. nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of .SEQ 
ID NO:l, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c), such that the 
nucleic acid molecules are expressed. 

118, A method for decreasing inclusion body formation in 
a microorganism expressing a heterologous protein 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
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transforming said microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
r ID NO:l, ■ 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO : 5 , 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, ' 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a), 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

119. A method for decreasing inclusion body formation in 
a microorganism expressing a heterologous protein 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 

^ of expressing a heterologous protein and 

, transforming said microorganism yith a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of ... 
intracellular glycine and at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the group consisting of : 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 



Ill 

a DNA sequence from nucleotide 1048 to. 1902 of SEQ 

,ID N0:1, s 
a DNA sequence from nucleotide 1031 to 1867 of SEQ. 

ID NO: 5, 

a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), {b)and (c) . 

A method for increasing the stability of a 
heterologous protein expressed in a microorganism, 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous; protein and 
transforming said microorganism with at least one 
nucleic acid molecule comprising a nucleotide, 
sequence selected from the group consisting of: 
a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, ( 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of .SEQ 
ID NO: 5, 

a nucleotide sequence which hybridizes to any of 
the sequences as defined in' (a), 

a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) , such that the 
nucleic acid molecules are expressed- * 
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121. A method for increasing the stability of a 
heterologous protfein expressed in a microorganism 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
transforming said microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, „ 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

" (b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide ; 
sequences defined in (a), (b) and (c) . 

122. A method for increasing the stability of a 
heterologous protein expressed in a microorganism 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
transforming said microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular glycine and at least one nucleic acid 

- molecule comprising a nucleotide sequence selected 
from the group consisting of: 
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(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, / 

a DNA sequence from nucleotide 221. to 1024 of SEQ 

,ID NO : 5, £ 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 

ID N0:1, , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , i 

(c) a fragment of a nucleotide sequence as defined in ( 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and (c) . 

123. . A, method for increasing the production of a 

heterologous protein expressed in a microorganism 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
transforming said microorganism with at least one 
nucleic acid molecule comprising a nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ^ 
ID N0:1, ' ^ . 

a DNA sequence from nucleotide 22 i to 1024 of SEQ 
ID N0:5 # ( , ' 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and ^ 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
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sequences defined in (a), (b)and (c) , such that the 
nucleic acid molecules are expressed. 

124. A method for increasing the production of a 
heterologous protein expressed in a microorganism 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable 
of expressing a heterologous protein and 
transforming said microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
one nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID *T0:1, f 

a MA sequence from nucleotide 221 to 1024 of SEQ . . 
ID N0:5, . 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

125 . A method for increasing the production of a 
heterologous protein expressed in a microorganism 
comprising the steps of transforming a 
microorganism with a nucleic acid molecule capable . 
of expressing a heterologous protein and 
transforming said microorganism with a nucleic acid 
molecule comprising a nucleotide sequence coding 
for an enzyme capable of increasing the amount of 
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intracellular glycine and at least one nucleotide 
sequence selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ ; 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , , ■ ■ 

(c) - a fragment of a nucleotide sequence as defined in 

(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b)and .(c)... 

126. An animal feed comprising a recombinant organism 

transformed with at least one nucleic acid molecule 
comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 to* 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
I D NO : 1 , 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) # 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) . 
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An animal feed comprising a recombinant organism 
transformed with a nucleic acid molecule comprising 
a nucleotide sequence coding^for an enzyme capable 
of increasing the amount of intracellular S- 
adenosyl methionine and at least one nucleotide 
sequence selected from the group consisting of: 
a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 221 "to 1024 of SEQ 
ID N0:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

a fragment of a nucleotide sequence as defined in 
(a) and (b) , and ' 
a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
Sequences defined in (a), (b) and (c) . 

An animal feed comprising a recombinant organism 
transformed with a nucleic 'acid molecule comprising 
a nucleotide sequence coding for an enzyme capable 
of increasing the amount of intracellular glycine 
and at least one nucleotide sequence selected from 
the group consisting of: 

a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO : 1 , 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO:5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

a nucleotide sequence /which hybridizes to any of 
the sequences as defined in (a), 
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(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 

i 

sequences defined in (a) ( , (b)and (c) . 

129. An animal feed ingredient comprising a recombinant 
organism transformed with at least one nucleic acid 
molecule comprising a nucleotide sequence selected 
from the-'grcpup consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID NO: 1/ 

a^DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO : 5 , 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID NO:l, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO: 5, 

(b) a : nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and - 

(d) a^nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a), (b)and (c) . 

130. An animal feed ingredient comprising a recombinant 
organism transformed with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
nucleotide sequence selected from the group 
consisting of : < 

(a), a DNA sequence from nucleotide 208 to 1047 of SEQ 
ID N0:1, , J 
, a DNA sequence from nucleotide 221 to 1024 of SEQ 

ID NO:5, N 
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a DNA sequence from nucleotide 104 8 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID NO:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) f 

(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 

(d) a nucleotide sequence which is degenerate as a 
result of the genetic code to any of the nucleotide 
sequences defined in (a) , (b) and (c) . 

131. An animal feed ingredient comprising a recombinant 
organism transformed with a nucleic acid molecule 
comprising a nucleotide sequence coding for an 
enzyme capable of increasing the amount of 
intracellular S-adenosyl methionine and at least 
nucleotide sequence selected from the group 
consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ 

ID NO: I, 

a DNA sequence from nucleotide 221 to 1024 of SEQ 
ID NO: 5, 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ 
ID N0:5, 

(b) a nucleotide sequence which hybridizes to any of 
the sequences as defined in (a) , 

'(c) a fragment of a nucleotide sequence as defined in 
(a) and (b) , and 
(d) a nucleotide sequence which is degenerate as a 

result of the genetic code to any of the nucleotide 
sequences defined in (a), (b) and .(c) . 



132. 



A DNA probe for use in identifying and cloning a 
nucleic acid molecule encoding a methyltransferase 



WO 00/1 1 142 PC17EW9/06037 

119 

comprising at least 15 nucleosides 'of- a nucleotide 
sequence selected from the group consisting of: 
(a) a DNA sequence from nucleotide 208 to 1047 of SBQ 
ID NO:l, ' 

a DNA sequence from nucleotide 221 to 1024 of SBQ 

ID NO: 5,, •' , 

a DNA sequence from nucleotide 1048 to- 1902 of SBQ 

ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SBQ 
ID NO: 5. , v 

133, A method for identifying and cloning a nucleic acid 
molecule encoding a methyltransf erase comprising 
the steps of hybridizing a probe consisting of at 
least 15 nucleotides of a nucleotide sequence" r 
selected from the group consisting of: 

(a) a DNA sequence from nucleotide 208 to 1047 of SEQ _ 

ID NO: 1/ v 
J a DNA sequence from nucleotide 221 to 10,24 of SEQ 

ID N0:5, / 

a DNA sequence from nucleotide 1048 to 1902 of SEQ 
ID N0:1, 

a DNA sequence from nucleotide 1031 to 1867 of SEQ ■ • ' . \ 
ID NO: 5, 

with a sample containing nucleic acid of an ■ ■ '. ' r :.T%* 

organism, detecting a nucleic acid molecule in said 
satrple which hybridizes to said; probe and isolating 
said detected nucleic acid molecule. 

y ' ■ u 

li34- A method for the purification of a methyl - 

transf erase capable of catalyzing the conversion of 
glycine to dimethyl glycine comprising the steps of 
subjecting a sample comprising- the methyl- 
transf erase to a matrix containing adenosine, 
binding said methyltransf erase to. said matrix and 
eluting said methyltransf erase from said matrix. 
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Figure 8 

GCAGGCTCGG TGCCACCAAT ACAGGGCAGC CCCCTGGCCA GAAACACGCC CCGTAGTTAC 60 
GGAGGCGCAT GGCATAGATA ATGAACTTGC TACAAGAGGC ACTTGACATT CAGTTGTCCA 120 
AGCGGCGCCG GCGATCAAGC ACACGATCTG AATTCACAGC GTATGGGTAT ACCGTAGCGC 180 

primer 1, 

GCGTTCGCTA CACCAACCAT TCGCCAAGCT GAGGTGATAT ATG AAT ACG ACT ACT .235 

| Met Asn Thr Thr Thr 
1 5 

-*GSMT 

GAG CAG GAC TTC GGA GCG GAC CCA ACA AAA GTC CGA GAT ACC GAC CAC * 283 
Glu Gin Asp Phe Gly Ala Asp Pro Thr Lys Val Arg Asp Thr Asp His 
10 15 20 

TAC ACG GAA GAG TAT GTT GAC GGG TTT GTC GAC AAA TGG GAC GAC TTG 331 
Tvx Thr Glu Glu Tyr Val Asp Gly Phe Val Asp Lys Trp Asp Asp Leu 
\ 25 30 35 

ATT GAT TGG GAT, AGC CGC GCG AAG AGT GAA- GGT GAT TTC TTT ATT CAA 379 
He Asp Trp Asp Ser Arg Ala Lys Ser Glu Gly Asp Phe Phe He Gin ' 
40 45 50 

GAG TTC AAG AAG CGT GGT GCC ACA CGC ATC CTC GAT GCC GCA ACA GGT 427 

Glu Leu Lys Lys/Arg Gly Ala Thr Arg lie Leu Asp Ala Ala Thr Gly 

55 60 65 

ACC GGC TTC CAC TCA^TG CGT TTG CTC . GAG GCG GGT TTT GAT GTC GTC 4 75 

Thr Gly Phe His Ser Val Arg Leu Leu Glu Ala Gly Phe Asp Val Val 
70 75 80 85. 

AGT GCT GAT GGC AGT GCC GAG ATG CTC GCC AAA GCC TTC GAG AAT GGG 523 
Ser Ala Asp Gly Ser Ala Glu Met Leu Ala Lys Ala Phe Glu Asn Gly 
90 95 100 

CGT AAG 1 CGT GGC CAT ATC CTA CGC ACG GTT CAG GTC GAC TGG CGG TGG . 571 

Arq Lys Arg Gly His He Leu Arg Thr Val Gin Val Asp Trp Arg Trp 
105 110 115 

, CTG AAT CGC GAT ATC CAC GGT CGT TAT GAC GCC ATT ATC TGC CTT GGC 619 
Leu Asn Arg Asp lie His Gly Arg Tyr Asp Ala He lie Cys Leu Gly 
120 125 130 

AAC TCG TTT ACC CAC CTG TTT AAT GAA AAA GAT CGG CGC AAG ACC CTG 667 
Asn Ser Phe Thr His Leu Phe Asn Glu Lys Asp Arg Arg Lys Thr Leu 
135 140 145 

' GCA GAG TTC TAT TCC GCG CTG AAC CCG GAA GGG GTG CTG ATA TTG GAT .715 
Ala Glu Phe Tyr Ser Ala Leu Asn Pro Glu Gly Val Leu He Leu Asp 
150 155 160 165 

CAG CGC AAT TAC GAC GGC ATC CTC GAT CAC GGC TAT GAC TCA AGC CAC ?63 
Gin Arg Asn Tyr Asp Gly lie Leu Asp His Gly Tyr Asp Ser Ser His 
170 175 180 

TCC TAC TAC TAT TGC GGT GAG GGC GTC TCT GTC TAT CCT GAG CAC GTT 811 
Ser Tyr Tyr Tyr Cys Gly Glu Gly Val Ser Val Tyr Pro Glu His Val 
185 190 195 

GAT GAT GGC CTA GCC CGC TTC AAG TAT GAA TTC AAC GAC GGT TCA ACT 8 59 

Asp Asp Gly Leu Ala Arg Phe Lys Tyr Glu Phe Asn, Asp Gly Ser Thr 
200 205 210 
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TAC TTC CTG AAC ATG TTC CCG ,CTG CGC AAG GAT TAC ACG CGC CGC CTA 
Tyr Phe Leu Asn Met Phe Pro Leu Arg Lys Asp Tyr Thr Arg Arg Leu 
215 220 225 

ATG CAT GAG GTG GGT TTC CAA AAA ATC GAT ACC TAC GGC GAC TTC AAG 
Met His, Glu Val Gly Phe Gin Lys He Asp Thr Tyr Gly Asp Phe Lys 
230 235 J>: 2 40 2 45 

GCA ACC TAC CGC GAT GCA GAT CCG GAT TTC TTT ATT CAT GTC GCC GAG 
Ala Thr Tyr Arg Asp Ala Asp Pro Asp Phe Phe He His Val Ala Glu 
,250 255 260 
primer 2 primer 3 

— —TT^ Q CGC GAT 



Met Ala Thr Arg Tyr Asp Asp 
1 5 
SDMT 



AAG GAA TAT CGG GAG GAG GAC^TGATAT 
Lys Glu Tyr Arg Glu Glu Asp 

/ 265 ' 

GSMT«— 

CAA GCC ATA GAA ACG GCG AGG CAG TAC TAT AAC AGT GAG GAT GCC GAT 
Gin Ala He Glu Thr Ala Arg Gin Tyr Tyr. Asn Ser Glu Asp Ala Asp 
10 15 20 

AAC TTC TAC GCC ATA ATA TGG GGT GGT GAG GAT ATC- CAT ATC GGT TTG 
Asn Phe Tyr Ala He He Trp Gly Gly Glu Asp He His He Gly Leu 
25 ,30 35 

TAC AAC GAT GAT GAA GAG CCG ATA GCA GAT GCC AGC AGG CGT ACT GTA 
Tyr Asn Asp Asp Glu Glu Pro He Ala Asp Ala Ser Arg Arg Thr Val 



40 



45 



55 



GAG CGC ATG TCG TCA CTG TCC CGG CAG CTT GGC CCA GAT AGC TAT GTC 
Glu Arg Met Ser Ser Leu Ser Arg Gin Leu Gly Pro Asp Ser Tyr Val 
60 65 70 

CTC GAC ATG GGG GCT GGT TAC GGA GGG TCT GCC CGT TAC CTT GCG CAT 

Leu Asp Met Gly Ala Gly Tyr Gly Gly Ser Ala Arg Tyr Leu Ala His 

* 75 80 85 

) 

AAG TAT GGC TGT AAG GTG GCG GCG CTC AAT CTC TCC GAA CGT GAG AAT 

Lvs TVr Gly Cys Lys Val Ala Ala Leu Asn Leu Ser Glu Arg Glu Asn 

90 95 100 

GAG CGT GAT CGG CAG ATG AAT AAG GAG CAA GGA GTC GAT CAC CTG ATT 
Glu Arg Asp Arg Gin Met Asn Lys Glu Gin Gly Val Asp His Leu He 
105 "0 115 

GAG GTT GTC GAT GCT GCC TTT GAA GAT GTC CCC TAT GAT GAT GGG GTG 
Glu Val Val Asp. Ala Ala Phe Glu Asp Val Pro Tyr Asp Asp Gly Val 
120 125 130 135 

TTT GAC CTG GTC TGG TCG* CAG GAC TCC TTC CTC CAT AGT CCG GAT AGG 
Phe Asp Leu Val Trp Ser Gin Asp" Ser Phe Leu His Ser Pro Asp Arg 
140 145 150 



GAA CGC GTA CTG CGG GAG GCT AGT AGA GTT CTG CGT TCT GGG GGC GAG 
Glu Arg Val Leu Arg Glu Ala Ser Arg Val . Leu Arg Ser Gly Gly Glu 
155 160 165 



TTT ATC TTC ACC GAT CCA ATG CAG GCG GAT GAT TGC CCC GAG GGG GTT 
Phe He Phe Thr Asp Pro Met Gin Ala Asp Asp Cys Pro Glu Gly Val 
170 175 180 

ATT CAA CCG ATC CTC GAT AGG ATC CAC CTC GAG ACG ATG GGC ACG CCT 
lie Gin Pro He Leu Asp Arg He His Leu Glu Thr Met Gly Thr Pro 
185 1^0 195 



907 



955 



1003 



1051 



1099 



1147 



1195 



1243 



1291 



1339 



1387 



1435 



1483 



1531 



1579 



1627 
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AAT TTC TAT CGC CAG ACC CTT AGG GAT CTG GGC TTT GAG GAA ATC ACC 1675 

Asn Phe Tyr Arg Gin Thr Leu Arg Asp Leu Gly Phe Glu Glu He Thr 
200 205 210 215 

TTC GAA GAC CAC ACC CAT CAG TTG CCG CGC CAT TAC GGG CGC GTG CGC 1723 
Phe Glu Asp His Thr His Gin Leu Pro Arg His Tyr Gly Arg Val Arg 
220 225 2 30 

CGT GAG CTA GAT CGT CGA GAA GGT GAA CTG CAG GGG CAT GTC TCG GCA 1771 
Arq Glu Leu Asp Arg Arg Glu Gly Glu Leu Gin Gly His Val Ser Ala 
235 240 245 

GAG TAC ATC GAA CGC ATG AAA AAT GGC CTT GAC CAC TGG GTC AAT GGC 1819 
Glu Tyr He Glu Arg Met Lys Asn Gly Leu Asp His Trp Val Asn Gly 
250 ,255 260 

primer 4 

GGT AAC AAG GGT TAT CTC ACC TGG GGT ATC TTT TAC TTC CGC AAA QQSL 1*67 
Gly Asn Lys Gly Tyr Leu Thr Trp Gly He Phe Tyr Phe Arg Lys Gly 

270 275 

TGATCACTAG AGCGCTAAAT CGCAGGCGCG GTAATTGTGC CGCGCTTCGG GCGCATATGT 1927 

TTTAGCCCAG TGGATTTGCA CATAGAAAAG. AGTGAATTCT GGTTTAGCGC TCGAATAAAC 1987 

TATAGCAAAG GAGAAACT ATG ACT AAG CGA TAT CTA TTT ACC TCT GAG TCG 2038 

[Met Thr Lys Arg Tyr Leu Phe Thr Ser Glu Ser 

1 5 10 
►SAMS 

GTC TCT GAA GGC CAC CCG GAC AAA ATG GCC GAC CAG ATT TCG GAT GCA 2086 
Val Ser Glu Gly His Pro Asp Lys Met Ala Asp Gin lie Ser Asp Ala k 
15 20 25 

CTG CTC GAT GAG TTC CTG CGC CAG GAT CCA AAG TCG CGG GTG GCT GCA 2134 
Leu Leu Asp Glu Phe Leu Arg Gin Asp Pro Lys Ser Arg Val Ala Ala 
30 35 40 

GAG ACG ATG ATT CAG ACC GGC ATG GTT GTA GTT GCC GGC GAA ATT AAG . 2182 
Glu Thr Met He Gin Thr Gly Met Val Val Val Ala Gly Glu He Lys 
45 50 55 

AGC 'AAT GCC AAG ATC AAT GTT GAG CCG CTG GTA CGT GAA GTT GTC CGC 2230 
Ser Asn Ala Lys He Asn Val Glu Pro Leu Val Arg Glu Val Val Arg 
60 65 70 75 

GAT ATC GGC TAC ACC AGC TCA GAT ATG GGC TTT GAT GCC GAC ACC TGT 227c 
Aso He Gly Tyr Thr Ser Ser Asp Met Gly Phe Asp Ala Asp Thr Cys 
80 85 90 

GCC GTA CTC AAC GCC CTC GGC GAG CAG TCC CCC GAC ATC AAT CAA GGC 2326 
Ala Val Leu Asn Ala Leu Gly Glu Gin Ser Pro Asp He Asn Gin Gly 
95 100 105 

GTT GAC CGG GAA GAG GAA GAG GAG CAG GGC GCC GGT GAC CAA GGA CTG 2374 
Val Asp Arg Glu Glu Glu Glu Glu Gin Gly Ala Gly Asp Gin Gly Leu 
110 115 120 

ATG TTC GGT TAC GCC ACC AAT GAG ACC GAC GTC CTC ATG CCG GCA GCG 2422 
Met Phe Gly Tyr Ala Thr Asn Glu Thr Asp Val Leu Met Pro Ala Ala 
125 130 135 

ATC CAC TAC TCG CAC CTG CTG GTT AAG CGC CAA TCC GAA GTC CGT AAC 24 70 

lie His Tyr Ser His Leu Leu Val Lys Arg Gin Ser Glu Val Arg Asn 
140 145 150 155 
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TCC AAG AAG CTG CCG TGG TTG CGC CCG GAC GCT AAG AGC CAG GTC ACC 2518 
Ser Lvs Lys Leu Pro^Trp Leu Arg Pro Asp Ala Lys Ser Gin Val Thr 
160 165 170 

TTC AAG TAC GAA GGC GAT AAG ATC GTC GGT TGC GAT GCG GTG GTG CTC 2566 
Phe Lys Tyr Glu Gly Asp Lys He Val Gly Cys Asp Ala Val Val Leu 
175 180 18 < 5 

TCC ACG CAG CAT GAC GAG ACC GTT GAT CAG AAG ACC GTC CAC GAG GGC 2614 

Ser Thr Gin His Asp Glu Thr Val Asp Gin Lys Thr Val His Glu Gly 
190 195 200 

r 

GTG ATG GAA GAG ATC ATC AAG CCC ATC CTC GGT GAC ACC GGC TGG CTG 2662 

Val Met Glu Glu He He Lys Pro He Leu Gly Asp Thr Gly Trp Leu 
205 210 215 

ACC AAC GAG ACC AAG TAC , CAC ATT AAC CCG i ACC GGC CGC TTC GTT ACT 2710 
Thr Asn Glu Thr Lys Tyr His He Asn Pro Thr Gly Arg Phe Val Thr 
220 225 230 235 

GGT GGT CCG TTG GGT GAC TGT GGC TTG ACC GGC CGT AAG ATC ATC CTC 2758 
Glv Glv Pro Leu Gly Asp Cys Gly 'Leu Thr Gly Arg Lys He lie Val 
1 240 245 250 

GAT ACC TAC GGC GGT ATG GGC CGT CAC GGC GGC GGT GCC TTC TCC GGC 2806 
Asp Thr Tyr Gly Gly Met Gly Arg His Gly Gly Gly Ala Phe Ser Gly 
. 255 . 260 265 

AAA GAT CCG TCT AAG GTC GAT CGC TCG GCG GCT TAT GTT GGC CGT TAT 2854 
Lvs Asp Pro Ser Lys Val Asp Arg Ser Ala Ala Tyr Val Gly Arg Tyr 
270 (' 275 280 tf 

GTA GCT AAG AAC ATA GTG GCT GCC GGT CTG GCC GAC CGC TGT GAG CTG 2902 
Val Ala Lys Asn lie Val Ala Ala Gly Leu Ala Asp Arg Cys Glu Val 
285 \ } 290 295 

CAG CTC TCC TAC GCC ATC GGT GTT GCC GAG CCG ACC TCG GTC AAT GTC 2950 
Gin Leu Ser Tyr Ala He Gly Val Ala Glu Pro Thr Ser Val Asn Val 
300 305 310 315 

GAA ACC TTT GGT ACC GGC AAG GTC GAA GAA GAG CTC GCG CAC GAT GGC 2998 
Glu Thr Phe Gly Thr Gly Lys Val Glu Glu Glu Leu Ala His Asp Gly 
320 325 330 

CAG CCC TAAGCCGTGG CCGCTGCGCG AGCCCTGGTA GCCGGCGCGA TCGAACGCCG ? 3054 
Gin Pro] * 

SAMS'*—' > ■ ' 

CAAAGATATG TGCCTGCTGC TCTGGGGGGA TCCCCGGACC GGTATCGCGC ACTTGGAAGG 3114 

TGATTCGGTC ATCGAGCTCC ACCGCGGTGG CGGCCGCTCT AGAACTAGTG GATCCCCCGG 3174' 

GCTCCAGGAA TTCGATATCA AGCTTATCGA TACCGTCGAC CTCGA 3219 
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AATCTGCGCC GGGATCGGAC GGCAAAGGTG CCGACCGAGT GTGCTGATCG ATGCCAGCGC 60 

GCGGCCCGCT AGTTCGGCTG CCCGCATCGG TCCGGCGTGC CCCTTGTGTG AGGTGACCCG 120 

GCGCGCCCGT GAAAGCTCGG GGCGGCGTAA TTCGCCGTGG TTCCGAGCGA TGCAGTGGCG 180 ■ 

primer 7 

CTTGACCAAC GCGAGTGGGA GCTGAGT AJG ACC AAG- AGC GTG GAC GAT CTT 231 

Met Thr Lys .Ser Val Asp Asp Leu 

; -i ' • 5 ■ 

I ► GSMT 

GCC CGT GGT GAG CAG GCC GGG GAC GAG CAG GAC CCG GTG CAC CGC GAG 279 
Ala Arq Gly Asp Gin Ala 'Gly Asp Glu Gin Asp Pro Val His Arg Glu 
10 15 20 

CAG CAG ACG TTC GGC GAC AAT CCG TTG GAA GTA CGC GAC ACT GAT CAC 327 
Gin Gin Thr Phe Gly. Asp Asn Pro Leu Glu Val Arg Asp Thr Asp His j 

25 30 35 "■ ■ x ■• 40 

TAC ATG CAT GAG TAC GTC GGT GGT TTT GTC GAC AAG TGG GAC GAT.CTG 375 
Tyr Met His Glu Tyr Val Gly Gly Phe Val Asp Lys Trp Asp Asp Leu 
45 50 • 55 

ATC GAT TGG AAG AAG- CGC TAC GAA AGC GAG. GGC AGC. TTC TTC. ATC GAC 423 
lie Asp Trp Lys Lys Arg Tyr Glu Ser Glu Gly Ser- Phe Phe lie Asp 
60 ' 65. 70 

CAA TTG CGC GCA CGC GGT GTC GAG ACC GTG CTG GAC GCG GCG GCC GGG 471 
Gin Leu Arg Ala Arg Gly Val Glu Thr Val Leu Asp Ala Ala Ala Gly 

75 80 85 , 

ACC GGT TTC CAC TCG GTC CGG TTG CTC GAG GAG GGG TTT GAG ACC GTC 519 
Thr Glv Phe His Ser Val Arg Leu Leu Glu Glu Gly Phe Glu Thr Val 
90 . '95 100 

AGC GCG GAC GGC AGC CCG CAG ATG CTG GCC AAG GCC TTC AGT AAC GGA , 567. 
Ser Ala Asp Gly Ser Pro Gin Met Leu Ala Lys Ala- Phe Ser Asn Gly 
10 5 110 H5 ; 120 

CTG GCC TAC AAC GGT CAC ATT CTG CGT GTG GTC AAC GCG GAC TGG CGT 615 
Leu Ala Tyr Asn Gly His lie Leu Arg Val Val Asn Ala Asp Trp Arg 
125 130 j 135 

TGG CTC AAC CGT GAC GTG CAC GGT GAA TAC GAC GCG ATC ATT TGC CTG 663 
Tro Leu Asn Arg Asp Val His Gly Glu Tyr Asp Ala lie lie Cys Leu 
.140 1*5 150 

GGC AAC TCC TTT ACC CAC CTG TTC TCG GAG CGG GAC CGC CGC AAG ACG 711 
Gly Asn Ser Phe Thr His Leu Phe Ser Glu Arg Asp Arg Arg Lys Thr 
155 i 160 165 

CTG GCT GAG TTC TAC GCG ATG CTC AAG CAC GAC GGT GTC, CTG ATC ATC . 759 

Leu Ala Glu Phe Tyr Ala Met Leu , Lys His Asp Gly Val Leu He He 
X70 175 180 

GAC CAG CGA AAC TAC GAC TCC ATT CTT GAC ACC GGC TTC TCC AGT AAG 807 
Asp Gin Arg Asn Tyr Asp Ser lie Leu Asp Thr Gly Phe Ser Ser Lys 
185 190 195 < 200 

CAC ACG TAT TAC TAC GCC GGT GAG GAC GTT TCC GCG GAG CCC GAC , CAC, 855 
His Thr Tyr Tyr Tyr Ala Gly Glu Asp Val Ser Ala Glu Pro Asp His 
205 210 215 
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ATC GAC GAC GGG CTG GCG CGG TTC AAG TAG ACG TTC CCG GAC AAG TCC 
lie Asp Asp Gly Leu Ala Arg Phe Lys Tyr Thr Phe Pro Asp Lys Ser 
220 225. 230 

GAA TTC TTC CTG AAC ATG TAC CCG CTG CGG AAA GAC TAC ATG CGG CGG 
Glu Phe Phe Leu Asn Met Tyr Pro Leu Arg Lys Asp Tyr Met Arg Arg 
235 240 245 

CTC ATG CGT GAG GTC GGT TTC CAA AGG ATT GAC ACC TAC GGT GAT TTC 
Leu Met Arg Glu Val Gly Phe Gin Arg He Asp Thr Tyr Gly Asp Phe 
250 255 260 



CAG GAA ACT TAC GGT GAA GAC 
Gin Glu Thr Tyr Gly Glu Asp 



265 



primer 5 



270 



GAG CCC GAC TTC TAC ATC CAC GTC GCG 
Glu Pro Asp Phe Tyr He His Val Ala 



275 



280 



AAG AGC TAC CGC ACC GAG 
51u Lys Ser Tyr Arg Thr Glu 
1 5 
►SDMT 

GCG GTG CAC ACC GCG CGG GAC 
Ala Val His Thr Ala Arg Asp 
20 



GSMT« 

GAC GAG TTC GTC GAC ATG TAC TCG AAC 
Asp Glu Phe Val Asp Met Tyr Ser Asn 



10 



15 



TAC TAC AAC TCC GAG GAC GCG GAC AAC 
Tyr Tyr Asn Ser Glu Asp Ala Asp Asn 
25 30 



TTC TAC TAC CAC GTC TGG. GGC GGC AAC GAC ATC CAC GTC GGG CTG TAC 
Phe Tyr Tyr His Val Trp Gly Gly Asn Asp lie His Val Gly Leu Tyr 
• • 35 40 45 

CAG ACA CCG CAG GAG GAC ATC GCC . ACC GCC AGT GAG CGC ACT GTC CAG 
Gin Thr Pro Gin Glu Asp He Ala Thr Ala Ser Glu Arg Thr Val Gin 
50 55 60 



CGG ATG GCG GGC AAG GTC GAC ATC AGC CCC GAA ACC AGG ATT CTG GAT 

Arq Met Ala Gly Lys Val Asp lie Ser Pro Glu Thr Arg lie Leu Asp 

65 70 75 AO 

CTC GGT GCC GGC TAC GGC GGA GCC GCG CGG TAC CTG GCC AGG ACC TAC 

Leu Gly Ala Gly Tyr Gly Gly Ala Ala Arg Tyr Leu Ala Arg Thr Tyr 

85 90 95 

GGC TGC CAC GTC ACC TGC CTC AAC CTC AGC GAG GTG GAG AAC CAG CGC 

Gly Cys His Val Thr Cys Leu Asn Leu Ser Glu Vai Glu Asn Gin Arg 

100 105 HO 

AAC CGC GAG ATC ACT CGC GCC GAG GGG CTC GAG CAC CTG ATC GAG GTG 

Asn Arg Glu He Thr Arg Ala Glu Gly Leu Glu His Leu He Glu Val 
115 120 125 

ACC GAC GGT TCC TTC GAG GAT CTC CCC TAC CAG GAC AAC GCG TTC GAC 

Thr Asp Gly Ser Phe Glu Asp Leu Pro Tyr Gin Asp Asn Ala Phe Asp 
130 135 14C 

GTC GTC TGG TCG CAG GAC TCC TTC CTC CAC . AGC GGT GAC CGC AGC AGG 

Val Val Trp Ser Gin Asp Ser Phe Leu His Ser Gly Asp Arg Ser Arg 

145 150 155 160 

GTC ATG GAA GAG GTG ACC CGG GTC CTC AAG CCG AAG GGT TCG GTG CTG 

Val Met Glu Glu Val Thr Arg Val Leu Lys Pro Lys Gly Ser Val Leu 

165 170 175 

TTC ACC GAT CCG ATG GCG TCC GAC TCG GCG AAG AAG AAC GAG CTC, GGC 

Phe Thr Asp Pro Met Ala Ser Asp Ser Ala Lys Lys Asn Glu Leu Gly 

180 , 185 190 



903 



951 



999 



1047 



1095 



1143 



1191 



1239 



1287 



1335 



1383 



1431 



1479 



1527 



1575 



1623 
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CCC ATC CTG GAC AGG CTG CAC. CTG GAC TCG CTC GGC TCG CCC GGT TTC 
Pro lie Leu Asp Arg Leu His Leu Asp Ser Leu Gly Ser Pro Gly Phe 
!95 200 205 

TAC CGG AAG GAG CTG ACT CGT CTC GGG CTG CAG AAC ATC GAG TTC GAG 
Tvr Arg Lys Glu Leu Thr Arg Leu Gly Leu Gin Asn lie Glu Phe Glu 
210 21 . 5 220 

GAC CTC AGC GAA TAC CTG CCC GTC CAC TAC GGC CGG GTT CTG GAA GTG 
2p leu Sr Glu Tyr.Leu Pro Val His Tyr Gly Arg Val Leu Glu Val 
225 230 235 240 

CTG GAG AGC CGG GAG AAC GAG CTC GCC GGC TTC ATC GGC '"GAG GAG TAC 
Leu Glu Ser Arg Glu Asn Glu Leu Ala Gly Phe lie Gly Glu Glu Tyr 
245 250 255 

CGA GCT CAC ATG AAG- ACC GGG, CTG CGC AAC TGG GTG CAG GCC GGC AAT . 

2g Ma His Met Lys Thr Gly' Leu Arg Asn Trp Val Gin Ala Gly Asn 
260 265 . Z7 °- 
^ 60 • primer 6 

GGC GGG AGC CTG GCC TGG GGC ATC ATC CAC GCC AGG QCA TGACAGCCGT 
Gly Gly Ser Leu Ala Trp Gly lie He His Ala Arg Ala * 
27S 280 285 

SDMT« ' 

CGCGTACGAA GAACGATCAG AAGCAGTTAG CAGTGAGGTG AAGATCAGCC GTGACTGAGA 
TGAACCGCAG GTTGTTCACC AAGTGAGTCC GTGACCGAGG GCCACCCGGA CAAG AT_G 



GCC GAC TCG ATC AGC GAC GCG ATC CTG GAC GCG ATG CTG GCT CAG GAC 
Ma Sp Ser lie Ser Asp Ala lie Leu Asp Ala Met Leu Ala Gin Asp 
5 .10 « 

rrr CGC TCC CGC GTG GCC ATG GAG ACC ATG ATC ACC ACC GGG CAG GTG 
P?o 2g Ser Arg Val Ala Met Glu Thr Met He Thr Thr Gly Gin Val 
20 25 30 . 

CAC CTG GCC GGT GAG GTG ACC ACC GAG GCC GAC GTC GAC CTG CCC GCG 
His Leu Ala Gly Glu Val Thr Thr Glu Ala Asp Val Asp Leu Pro Ala 
35 40 45 

ATC GTG CGG GAG AAG GTC CTC GAG ATC GGC TAC GAC AAC TCG GCC AAG 
ill Sal Arg Glu Lys Val Leu Glu lie Gly Tyr Asp Asn Ser Ala Lys 
50 55 60 

GGC TTC GAC GGA GAC TCC TGC GGC ATC AAC GTC TCC ATC GAC GCG CAG 
Sy Se Asp Gly Asp Ser Cys Gly He Asn Val Ser lie Asp Ala Gin 
* 70 75 bu 

TCC CCG GAC ATC GGC CAG GGC GTG GAC TCC GCT CAC GAG TCC CGC GTC " 
III So Asp lie Gly Gin Gly Val Asp Ser Ala His Glu Ser Arg Val 

GAG GGT GCC ATC GAC GAG ATC GCC AGT CAG GGC GCC GGC GAC CAG GGC 
SS Gly Ma lie Asp Glu lie Ala Ser Gin Gly Ala Gly Asp Gin Gly 
100 105 HO 

CTG ATG TTC GGT TAC GCC ACC AGC GAG ACC GAC GAG CTC ATG CCG CTG 
22 S Phe Gly Tyr Ala Thr Ser Glu Thr Asp Glu Leu Met Pro Leu 
115 120 125 „ 

CCG ATC GCG TTG GCC CAC CGC ATG TCG CGT CGA CTG ACC CGC GTG CGC 
Pro lie Ala Leu Ala His Arg Met Ser Arg Arg Leu Thr Arg Val Arg 
130 135 140 "5 



SAMS 



1671 

1719 

1767 

1815 

1863 

1912 

1972 
2029 

2077 

2125 

2173 

2221 

2269 

2317 

2365 

2413 

2461 
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AAC GAC GGC ACG CTG CCG TAC CTG CGT GCC GAC GGC AAG ACC CAG GTC 
Asn Asp Gly Thr Leu Pro Tyr Leu Arg Ala Asp Gly Lys Thr Gin Val 
150 155 160 

ACC GTC GAG* TAC GCC GGT GAC CAG CCG GTT CGC CTG GAC ACC ACG GTG 
Thr Val Glu Tyr Ala Gly Asp Gin Pro Val Arg Leu Asp Thr Thr val 
165 .170 175 

CTG TCC AGC CAG CAC GCC GAG GAC GTC GAC CTC GAC AAG CAA CTG ATC 
Leu Ser Ser Gin His Ala Glu Asp Val Asp Leu Asp Lys Gin Leu lie 
180 185 190 

CCC GAG GTC AGG GAC AAG GTC ATC ACC CCG GAG ATC GAG AAG GTC GGG 
Pro Glu Val Arg Asp Lys Val He Thr Pro Glu He Glu Lys Val Gly 

205 



195 



200 



CTG GAC ACC TCG GAC ATG CGT CTG CTG GTG AAT CCG ACG GGT CGG TTC 
Leu Asp Thr Ser Asp Met Arg Leu Leu Val Asn Pro Thr Gly Arg, Phe 
210 23 -5 ' 220 225 

GTC ACG GGT GGT CCG ATG GGG- TGACTGCGGG CTGACCGGCC GCAAAGATCA 
Val Thr. Gly Gly Pro Met Gly * 
230 

SAMS* 1 

TCGTCGACAC CTACGGCGGG ATGGCCCCGC CACGGTGGCG GTGCCTCTCC GGTAAAGGAC 



CT 



2509 

2557 

2605 

2653 

2701 

2752 

.2812 
2814 



} 
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Fig. 10 
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Fig. 12 
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SEQUENCE LISTING 

' <110> ReiniJcainen, Tapani 
Nyyss&ia, Antti 
Kerovuo, Janne 

<120> Methyltransf erases, nucleic acid molecules encoding 
methyltransf erases, their recombinant expression and 
uses thereof 

<130> 73695 

<140> 
<141> 

<160>"30 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2814 
<212> DNA 

<213> Actinopolyspora halophila 

<220> . 
<221> CDS 

<222> (208) . . (1047) 

<220> 
<221> CDS 

<222> (1048) (1902) 

<220> 
<221> CDS 

<222> (2027) (2722) 
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<400> 1 . ' -t 

aatctgcgcc gggatcggac ggcaaaggtg ccgaccgagt gtgctgatcg atgccagcgc 60 
gcggcccgct agttcggctg cccgcatcgg tccggcgtgc cccttgtgtg aggtgacccg 120 
gcgcgcccgt gaaagctcgg ggcggcgtaa ttcgccgtgg ttccgagcga tgcagtggcg 



180 



cttgaccaac gcgagtggga gctgagt atg acc aag age gtg gac gat ctt gec 234 

Met Thr Lys Ser Val Asp Asp Leu Ala 



cgt ggt gac cag gec ggg gac gag cag gac ccg gtg cac cgc gag cag 

Arg Gly. Asp Gin Ala Gly Asp Glu Gin Asp Pro Val His Arg Glu Gin 

15 20 .25 

10 . 

cag acg ttc ggc gac aat ccg ttg gaa gta cgc gac act gat cac tac 

Gin Thr Phe Gly Asp Asn Pro Leu Glu Val Arg Asp Thr Asp His Tyr 

.30 " 35 

atg cat gag tac gtc ggt ggt ttt gtc gac aag tgg gac gat^ ctg ate 
Met His Glu Tyr Val Gly Gly Phe Val Asp Lys Trp Asp Asp Leu lie 

45 50 . > 55 . ' 

gat tgg aag aag cgc tac gaa age gag ggc age ttc ttc ate gac caa 
Asp Trp Lys Lys Arg Tyr Glu Ser Glu Gly Ser Phe Phe He Asp Gin 
60 65 70 

ttg cgc gca cgc ggt gtc gag acc gtg ctg gac gcg gcg gec ggg acc 
Leu Arg Ala Arg Gly Val Glu Thr Val Leu Asp Ala Ala Ala Gly Thr 
75 80 85 . 

ggt ttc cac teg gtc egg ttg etc gag gag ggg ttt gag acc gtc age 
Gly Phe His Ser Val Arg Leu Leu Glu Glu Gly Phe Glu Thr Val Ser 

' SS 



282 



330 



378 



426 



474 



522 
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gcg gac ggc 
Ala Asp Gly 



age ccg cag atg 

i, 

Ser Pro Gin Met 
110 



ctg gec aag gec ttc agt aac gga ctg 
Leu Ala Lys Ala Phe Ser Asn Gly Leu 
115 120 



570 



gec tac aac 
Ala Tyr Asn 



: f 

ggt cac att ctg 
Gly His lie Leu 
125 



cgt gtg gtc aac gcg gac tgg cgt tgg 618 
Arg Val Val Asn Ala Asp Trp Arg Trp 
130 135 



etc, aac cgt 
Leu Asn Arg 
140 



gac gtg cac ggt 
Asp Val His Gly 



gaa tac gac gcg ate att tgc ctg ggc 666 
Glu Tyr Asp Ala He He Cys Leu Gly 
145 1'50 



aac tec ttt 
Asn Ser Phe 
155 



acc cac ctg ttc 
Thr His' Leu Phe 
160 



teg gag egg gac cgc cgc aag acg ctg 714 
Ser Glu Arg Asp Arg Arg Lys Thr Leu 
' 165 



get gag ttc tac gcg atg etc aag cac gac ggt gtc ctg ate ate gac 762 j 
Ala Glu Phe lyr Ala Met Leu Lys His Asp Gly. Val Leu He He Asp 

180 185 



170 



175 



cag cga aac tac gac tec att ctt gac acc ggc ttc tec agt aag cac 810 
Gin Arg Asn Tyr Asp Ser He Leu Asp Thr Gly Phe Ser Ser Lys His 
190 " 200 



acg tat tac tac gee ggt gag gac gtt tec gcg gag ccc gac cac ate 858 
Thr Tyr Tyr Tyr Ala Gly Glu Asp Val Ser Ala Glu'pro Asp His He 

205 210 ^215 

r • ■ • 

gac gac ggg ctg gcg egg ttc aag tac acg ttc ccg gac. aag tec gaa 906 
Asp Asp Gly Leu Ala Arg Phe Lys Tyr Thr Phe Pro Asp Lys Ser Glu 



220 



225 



230 



ttc ttc ctg aac atg tac ccg ctg egg aaa gac tac atg egg egg etc 954 
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1002 



X050 



1098 



Phe Phe Leu Asn Met Tyr Pro Leu Arg Lys Asp Tyr Met Arg Arg Leu 
235 240 • , 245 

atg cgt gag gtc ggt ttc caa agg att gac acc tac ggt gat ttc. cag 
Met Arg Glu Val Gly Phe Gin Arg lie Asp Thr Tyr Gly Asp Phe Gin 
250 255 260 265 

gaa act tac ggt gaa gac gag ccc gac ttc tac ate cac gtc gcg gag 
Glu Thr Tyr Gly Glu Asp Glu Pro Asp Phe Tyr He His Val Ala Glu 
270 275' 280 1 

aag age tac cgc acc gag gac gag ttc gtc gac atg tac teg aac gcg 
Lys Ser Tyr Arg Thr Glu Asp Glu Phe Val Asp Met Tyr Ser Asn Ala 
5 10 15 

gtg cac acc gcg egg gac tac tac aac tec gag gac gcg gac aac ttc 
Val His Thr Ala Arg Asp Tyr Tyr Asn Ser Glu Asp Ala Asp Asn Phe 
20 25 3° 

tac tac cac gtc tgg ggc ggc aac gac ate cac gtc ggg ctg tac cag 
Tyr Tyr His Val Trp Gly Gly Asn Asp lie His Val Gly Leu Tyr Gin 
35 40 45 . 



aca ccg cag gag gac ate gec acc gee agt gag cgc act gtc cag. egg 1242 
Thr Pro Gin Glu Asp He Ala Thr Ala Ser Glu Arg Thr Val Gin Arg 
SO 55 60 .65 



1146 



1194 



1290 



atg gcg ggc aag gtc gac ate age ccc gaa acc agg att ctg, gat etc 
Met Ala Gly Lys Val Asp He Ser Pro Glu Thr Arg He Leu Asp Leu 
70 75 80 



ggt gee ggc tac ggc gga gec gcg egg tac ctg gec agg acc tac ggc 1338 
Gly Ala Gly Tyr Gly Gly Ala Ala Arg Tyr Leu Ala Arg Thr Tyr Gly 
85 90 95 
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1386 



1434 



1482 



1530 



tgc cac gtc acc tgc etc aac etc age gag gtg gag aac cag cgc aac • 
Cys His Val Thr Cys Leu Asn Leu Ser Glu Val Glu Asn Gin Arg Asn 
100 105 HO . 

cgc gag ate act cgc gee gag ggg etc gag cac ctg ate gag gtg acc 
Arg Glu He Thr Arg Ala Glu Gly Leu Glu His Leu lie Glu Val Thr 
115 • " . 120 125 _ 

f . ' ' ■ ' 

gac ggt tec ttc gag gat etc ccc tac cag gac aac gcg ttc gac gtg 
Asp Gly Ser Phe Glu Asp Leu Pro Tyr Gin Asp Asn Ala Phe Asp Val 
130 "5 140 . ; , 145 

,gtc tgg teg cag gac tec ttc etc cac age ggt gac cgc age agg gtc 
Val Trp Ser Gin Asp Ser -Phe Leu His Ser Gly Asp Arg Ser Arg Val 
150 155 16" 

atg gaa gag gtg acc egg gtc etc aag ccg aag ggt teg gtg ctg ttc 
Met Glu Glu Val Thr Arg Val- Leu Lys Pro, Lys Gly Ser Val Leu Phe 
165 170 175 

acc gat ccg atg gcg tec gac teg gcg aag aag aac gag etc ggc ccc 1626 
Thr Asp Pro Met Ala Ser Asp Ser - Ala Lys Lys Asn Glu Leu Gly Pro 
180 185 190 

ate ctg. gac agg ctg cac ctg gac teg etc ggc teg ccc ggt ttc tac 1674 
lie Leu Asp Arg Leu His Leu Asp Ser Leu Gly Ser Pro Gly Phe Tyr 
195 200 20S 

egg aag gag ctg act cgt etc ggg ctg cag. aac ate gag ttc gag gac 
Arg Lys Glu Leu Thr Arg Leu Gly Leu Gin Asn lie Glu Phe Glu Asp 
210 • 215 . ' 220 225. 

etc age gaa tac ctg ccc gtc cac tac ggc egg gtt ctg gaa gtg ctg 1770 



1578 



1722 
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6 f .' 

Leu Ser Glu Tyr Leu Pro Val His Tyr Gly Arg Val Leu Glu Val Leu 
230 -235 240 

gag age egg gag aac gag etc gee ggc ttc ate ggc gag gag tac cga 1818 
Glu Ser Arg Glu Asn Glu Leu Ala Gly Phe lie Gly Glu Glu Tyr Arg 
245 250 255 

get cac atg aag acc ggg ctg cgc aac tgg gtg cag gee ggc a at ggc 1866 
Ala His Met Lys Thr Gly Leu Arg Asn Trp Val Gin Ala Gly Asn Gly 
260 •265 270 f 

ggg age ctg gee tgg ggc ate ate cac gec agg gca tgacagccgt 1912 
Gly Ser Leu Ala Trp Gly lie He His Ala Arg Ala 
■ 275 . 280 285 

cgcgtacgaa gaacgatcag aagcagt'tag cagtgaggtg aagatcagee gtgactgaga 1972 

tgaaccgcag gttgttcacc aagtgagtcc gtgaccgagg gccacccgga caag atg 2029 

( Met 

1 

gc^gac teg ate age gac gcg ate ctg gac gcg atg ctg get cag gac 2077 
Ala Asp Ser He Ser Asp Ala' lie Leu Asp Ala Met Leu Ala Gin Asp 
• 5 / 10 15 

ccc cgc tec cgc gtg gee atg gag acc atg ate acc acc ggg cag gtg 2125 
Pro Arg Ser Arg Val Ala Met Glu Thr, Met lie Thr Thr. Gly Gin Val 
20 25 30 

cac ctg gec ggt gag gtg acc acc gag gec gac gtc gac ctg ccc gcg 2173 
His Leu Ala Gly Glu Val Thr Thr Glu Ala Asp Val Asp Leu Pro Ala 
. 35 40 45 



ate gtg. egg gag aag gtc etc gag 



ate ggc tac gac aac teg gec aag 2221- 
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lie Val Arg Glu Lys Val Leu Glu lie Gly Tyr Asp Asn Ser Ala Ly S/ 
50 



55 60 « 



ggc ttc gac gga gac tec tgc ggc ate aac gtc tec atc.gac gcg cag 2269 
Gly Phe Asp Gly Asp Ser Cys Gly He Asn Val Ser lie Asp Ala Gin 
70 75 80 

tec ccg gac ate ggc cag ggc gtg gac, tec get cac gag tec cgc gtc 2317 
Ser Pro Asp He Gly Gin Gly Val Asp Ser Ala His Glu Ser Arg Val 
85 • 90 ' 95 

gag ggt gec ate gac gag ate gec agt cag ggc gec. ggc gac cag ggc 2365 
Glu Gly Ala lie Asp Glu lie Ala Ser Gin Gly Ala Gly Asp Gin Gly , 
100 105 110 • 

ctg atg ttc ggt tac gec acc age gag ace gac gag ,ctc atg ccg ctg 2413 
Leu Met Phe Gly Tyr Ala Thr Ser Glu Thr Asp Glu Leu Met Pro Leu 
115 / 120 125 

ccg ate gcg ttg gec cac cgc atg teg cgt cga ctg acc cgc gtg cgc 2461 
Pro He Ala Leu Ala His Arg Met Ser Arg Arg Leu Thr Arg Val Arg 



130 



135 , 145 



aac gac ggc acg ctg ccg tac ctg cgt gec gac ggc aag acc cag gtc 2509 
Asn Asp Gly Thr Leu Pro Tyr Leu Arg Ala Asp Gly Lys Thr Gin Val 
150 155 160 

acc gtc gag tac gec ggt gac cag ccg gtt cgc ctg gac acc acg gtg 2557 
Thr Val Glu Tyr Ala Gly Asp Gin Pro Val Arg Leu Asp Thr Thr Val 
165 170 I 75 

ctg tec age cag cac gec gag gac gtc gac etc gac aag caa ctg ate 2605 
Leu Ser Ser Gin His Ala Glu Asp Val Asp Leu Asp Lys Gin Leu lie 
180 185 I 90 



V 
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o / 



ccc gag gtc agg gac aag gtc ate acc ccg gag ate gag aag gtc ggg 2653 
Pro Glu Val Arg Asp Lys Val lie Thr Pro Glu lie Glu Lys Val Gly 
195 200 205 

ctg gac acc teg gac atg cgt ctg ctg gtg aat ccg acg ggt egg ttc ^ 2701 
Leu Asp Thr Ser Asp Met Arg Leu Leu Val Asn 'Pro Thr Gly Arg Phe 
21 0 215 220 225 

gtc acg ggt ggt ccg atg ggg tgactgcggg ctgaccggcc gcaaagatca 2752 
Val Thr Gly, Gly Pro Met Gly 

230 . 

tcgtcgacac etaeggeggg atggccccgc cacggtggcg gtgcctctcc ggtaaaggac 2812 

2814 

ct . • ; * . 



<210> 2 ■ 

<211> 280 ' 
<212> PRT 

<213> Actinopolyspora halophila 

: ' • ' v • . •• - 

<400> 2 t ] 

Met Thr Lys Ser Val Asp Asp Leu Ala Arg Gly Asp Gin Ala Gly Asp 
1 5 10 15 

Glu Gin Asp Pro Val His Arg Glu Gin Gin Thr Phe Gly Asp Asn Pro 
20 25 30 

Leu Glu Val Arg Asp Thr Asp His Tyr Met His Glu Tyr Val Gly Gly 

35 40 (45 

\ ■ ' * ~ ' ...... . , . 

'■- Phe Val Asp Lys Trp Asp Asp Leu He Asp Trp Lys Lys Arg Tyr Glu 
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50 ) 



55 



60 



Ser Glu Gly Ser Phe Phe lie Asp Gin Leu Arg Ala Arg Gly Val Glu 
65 ™ * 75 80 

Thr Val Leu Asp Ala Ala Ala Gly Thr Gly Phe His Ser Val Arg^Leu 
85 90 95 



Leu 



Glu Glu Gly Phe Glu Thr Val Ser Ala Asp Gly Ser Pro Gin Met 



100 



105 



110 



Leu. Ala Lys Ala Phe Ser Asn Gly Leu Ala Tyr Asn Gly His He Leu 

120 125 



115 



Arg 



val Val Asn Ala Asp Trp Arg Trp Leu Asn Arg Asp Val His Gly 



130 



135 



.140 



Glu Tyr Asp Ala lie lie Cys Leu Gly Asn Ser Phe Thr His Leu Phe 
145 150 ■ ' 155 1^0 

t 
/ 

Ser Glu Arg Asp Arg Arg Lys Thr Leu Ala Glu Phe Tyr, Ala Met Leu 
165 . 170 "5 



Lys His Asp Gly val Leu lie lie Asp Gin. Arg Asn Tyr Asp Ser lie 

185 1 ^ 190 



iao 



Leu Asp Thr Gly Phe Ser Ser Lys His. Thr Tyr Tyr Tyr Ala Gly Glu 
195 200 205 



Asp Yal Ser Ala Glu Pro Asp. His He Asp Asp Gly Leu Ala Arg Phe 

215 220 



210 



Lys Tyr Thr Phe Pro Asp Lys Ser . Glu Phe Phe Leu Asn Met Tyr Pro 

230 235 240 



225 
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Leu Arg Lys Asp Tyr Met Arg Arg 
. 245 

Arg He Asp Thr Tyr Gly Asp Phe 
260 

Pro Asp Phe Tyr He His Val Ala 
275 -280 



<210> 3 
<211> 285 
<212> PRT 

<2 13 > Act iriopblyspora halophi la 



Leu Met Arg Glu Val Gly Phe Gin 
250 255 

Gin Glu Thr Tyr Gly Glu Asp Glu 
265 270 



<400> 3 

Glu Lys Ser Tyr Arg Thr Glu Asp Glu Phe Val Asp. Met Tyr Ser Asn 
1 5 10 15 

Ala Val His Thr Ala Arg Asp Tyr Tyr Asn Ser Glu Asp Ala Asp Asn 
20 25 30 

Phe Tyr Tyr His Val Trp Gly Gly Asn Asp He His Val Gly Leu Tyr 
35 40 45 

Gin Thr Pro Gin Glu Asp ' He Ala Thr Ala Ser Glu Arg Thr Val Gin 
50 55 60 

Arg Met Ala Gly Lys Val Asp He Ser Pro Glu Thr Arg lie Leu Asp 
65 70 75 80 



L u Gly Ala Gly Tyr 
85 



Gly Gly Ala Ala Arg Tyr Leu Ala Arg Thr Tyr 
90 95 
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Gly Cys His Val Thr Cys Leu Asn Leu Ser Glu* Val Glu Asn Gin. Arg 
,100 105 110 



Asn Arg 



Glu He Thr Arg Ala Glu Gly Leu Glu His Leu He Glu Val 



115 



120 



125 



Thr Asp Gly Ser Phe Glu Asp Leu Pro Tyr Gin Asp Asn Ala Phe Asp 

■ 135 140 . 



130 



Val Val Trp Ser Gin Asp Ser Phe Leu His Ser Gly Asp. Arg Ser Arg 

155 . 160 



145 



150 



Val Met 



Glu Glu Val Thr Arg Val Leu Lys Pro Lys Gly . Ser Val Leu 



165 



170 



175 



Phe 



Pro 



Thr Asp Pro Met Ala Ser Asp Ser Ala Lys Lys Asn Glu Leu Gly 



180 



185 



190 



He Leu Asp Arg Leu His Leu Asp Ser Leu' Gly Ser Pro Gly Phe 



195 



200 



205 



Tyr Arg Lys Glu Leu Thr Arg Leu Gly Leu Gin Asn He Glu Phe Glu 

215 220. 



210 



Asp Leu Ser Glu Tyr Leu Pro Val His Tyr Gly Arg Val Leu Glu Val 

230 235 240 



225 



Leu Glu Ser Arg Glu Asn Glu Leu Ala Gly Phe lie Gly Glu Glu Tyr 
245 250 255 



Arg Ala His. Met Lys Thr Gly Leu Arg Asn Trp Val Gin Ala , Gly Asn 
260 ^ 265 270 
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Gly Gly Ser Leu Ala Trp Gly lie lie His Ala Arg Ala 
275 280 285 



<210> 4 
<211> 232 
<212> PRT 

<213> Actinopolyspora halophila 

<400> 4 J 
Met Ala Asp Ser lie Ser Asp Ala lie Leu Asp Ala Met Leu Ala Gin 

5 



10 ' 15 



Asp Fro Arg Ser Arg Val, Ala Met Glu Thr Met He Thr Thr Gly Gin 
20 25 V 30 



Val His Leu Ala Gly Glu Val Thr Thr Glu Ala Asp Val Asp Leu Pro 
35 40 45 

Ala lie Val Arg Glu Lys Val Leu Glu He Gly Tyr Asp Asn Ser Ala 

1 50 .55 - .60 - - 

Lys Gly Phe-Asp Gly Asp Ser Cys Gly lie Asn Val Ser lie Asp Ala 

65 ' 70 ■ 75- .80 f 



Gin Ser Pro Asp He Gly Gin Gly VaUAsp Ser Ala His Glu Ser Arg 
85 90 • 1 . 95 



Val Glu Gly Ala He Asp Glu He Ala Ser Gin Gly Ala Gly Asp Gin 
• 100 105 no 

Gly Leu Met Phe Gly Tyr Ala Thr Ser Glu Thr Asp Glu Leu Met Pro 
115 120 125 



WO 00/11142 



13 



PCT/EP99/06037 



Leu Pro lie Ala* Leu Ala His Arg 

130 , \ 135 

Arg Asn Asp Gly Thr Leu Pro Tyr 
145 > 150 

t i 

Val Thr Val 'Glu Tyr Ala Gly Asp 
165 

Val Leu Ser Ser Gin His Ala Glu 
180 

He Pro Glu- Val Arg Asp' Lys Val 
195 . ' 200 



Gly Leu Asp Thr Ser Asp Met Arg 
210 215 

Phe Val Thr Gly Gly Pro Met Gly 
225 230 



Met Ser Arg Arg Leu Thr Arg Val 
140 

Leu Arg Ala Asp Gly Lys Thr Gin 
155 160 

Gin Pro Val Arg Leu Asp Thr Thr. 
170 175 

\ ' :• 

Asp Val Asp Leu Asp Lys Gin Leu 
185 190 

lie Thr Pro Glu lie Glu Lys Val 
205 



Leu Leu Val Asn Pro Thr Gly Arg 
220 



<210> 5 
<211> 3219 
<212> DNA 

<213> Ectothiorhodospira halochloris 

<220> 
<221> CDS 

<222> (221) . . (1024) 

<220> 
<221> CDS 

<222> (1031) . . (1867) 
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<220> 
<221> CDS 

<222> (2006) . . (3004) 

V " ' 

<400> 5 

gcaggctcgg tgccaccaat acagggcagc cccctggcca gaaacacgcc. ccgtagttac 60 

ggaggcgcat gccatagata atgaacttgc tacaagaggc acttgacatt cagttgtcca 120 

agcggcgccg gcgateaagc acacgatctg aattcacagc gtatgggtat accgtagcgc 160 

gcgttcgcta caccaaccat tcgccaagct gaggtgatat acg aat acg act act 235 

Met Asn Thr Thr Thr 



gag cag gac ttc gga gcg gac cca aca aaa gtc cga gat acc gac cac 283 
Glu Gin Asp Phe Gly Ala Asp Pro Thr Lys Val Arg Asp Thr Asp His 
10 15 20 

tac acg gaa gag tat gtt gac ggg . ttt gtc gac aaa tgg gac gac ttg 331 
Tyr Thr Glu Glu Tyr Val Asp Gly Phe" Val Asp Lys Trp Asp Asp Leu 
25 30 35 

att gat tgg gat age cgc gcg aag agt gaa ggt gat ttc ttt att caa 379 
He Asp Trp Asp Ser Arg Ala Lys Ser Glu Gly Asp Phe Phe He Gin 
40 45 50 

gag ttg aag aag cgt ggt gec aca cgc ate etc gat gec gca aca ggt 427 
Glu Leu Lys Lys Arg Gly Ala Thr Arg He Leu Asp Ala Ala Thr- Gly 
55 60 65 

acc ggc ttc cac tea gtg cgt ttg etc gag gcg ggt ttt gat gtc gtc 475 
Thr Gly Phe His Ser Val Arg Leu Leu Glu Ala Gly Phe Asp Val Val 



WO 00/1 II 42 

J 



15 
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70 



75 80 . 85 



agt get gat ggc agt gec gag atg etc gec aaa gec ttc gag aat ggg 

Ser Ala Asp Gly Ser Ala Glu Met Leu Ala Lys Ala Phe Glu Asn Gly 
90 95 100 

J 

cgt aag cgt ggc cat ate eta cgc acg gt't cag gtc gac tgg egg tgg 

Arg Lys Arg Gly His lie Leu Arg Thr Val Gin Val Asp Trp Arg Trp 
105 I 10 115 

ctg aat cgc gat ate cac ggt cgt tat gac gec att ate tgc ctt ggc 

Leu Asn Arg Asp lie His Gly Arg Tyr Asp Ala lie lie Cys Leu Gly 



120 



125 130 



aac teg ttt ace cac ctg ttt aat gaa aaa gat egg cgc aag ace ctg 
Asn Ser Phe Thr His Leu Phe Asn Glu Lys Asp Arg Arg Lys Thr Leu 
135 "0 145 

gca gag ttc tat t« gcg ctg aac ccg gaa ggg gtg ctg ata ttg gat 
Ala Glu Phe Tyr Ser Ala Leu Asn Pro Glu Gly Val Leu He Leu Asp 
150 155 . 160 165 

cag cgc aat tac gac ggc ate etc gat cac ggc tat gac tea age cac 
Gin Arg Asn Tyr Asp Gly He Leu Asp His Gly Tyr Asp Ser Ser His 
7 170 ■ 175 180 

tec tac tac tat tgc ggt gag ggc gtc tct gtc tat cct gag cac gtt 
Ser Tyr Tyr Tyr Cys Gly Glu Gly Val Ser Val Tyr Pro Glu His Val 
185 190 195 " 

gat gat ggc eta gec cgc ttc aag tat gaa ttc aac gac ggt tea act 
Asp Asp- Gly Leu Ala Arg Phe Lys Tyr Glu Phe Asn Asp Gly Ser Thr 
200 ' 205' 210 



523 



571 



619 



6157 



715 



763 



an 



859 
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907 



955 



1003 



1051 



1099 



tac ttc ctg aac atg ttc ccg ctg cgc aag gat tac acg cgc cgc eta 
Tyr Phe Leu Asn Met Phe Pro Leu Arg Lys Asp. Tyr Thr Arg Arg Leu 
215 220 225 

atg cat gag gtg ggt ttc caa aaa ate gat acc tac ggc gac ttc aag 
Met His Glu Val Gly Phe Gin Lys lie Asp Thr Tyr Gly Asp Phe Lys 
230 235 240 245 

gca acc tac cgc gat gca gat ccg gat ttc ttt att cat gtc gec gag 
Ala Thr Tyr Arg Asp Ala Asp Pro Asp Phe Phe He His Val Ala Glu 
250 255 260 

aag gaa tat egg gag gag gac tgatat atg gcg acg cgc tac gac gat 
Lys Glu Tyr Arg Glu Glu Asp Met Ala Thr Arg Tyr Asp Asp 

265 1 " 5 

caa gec ata gaa acg gcg agg cag tac tat aac agt gag gat gec gat 
Gin Ala He Glu Thr Ala Arg Gin Tyr Tyr Asn Ser Glu Asp Ala Asp 
10 15 20 

aac ttc tac gec ata ata tgg ggt ggt gag gat ate cat ate ggt ttg 
Asn Phe Tyr Ala He He Trp Gly Gly Glu Asp He His He Gly Leu 
25 30 35 . 

tac aac gat gat gaa gag ccg ata gca gat gee age agg cgt act gta 
Tyr Asn Asp Asp Glu Glu Pro He Ala Asp Ala Ser Arg Arg Thr Val 
40 45 50 55 

gag cgc atg teg tea ctg tec egg cag ctt ggc cca gat age tat gtc 
Glu Arg Met Ser Ser Leu Ser Arg Gin Leu Gly Pro Asp Ser Tyr Val 
60 65 70 



etc gac atg ggg get ggt tac gga ggg tct gec cgt tac ctt gcg cat 1291 
Leu Asp Met Gly Ala Gly Tyr Gly Gly Ser Ala Arg Tyr Leu Ala His 



1147 



1195 



1243 



WO 00/11142 PCT/EP99/06037 
/ 17 



75 



80 85 



aag tat ggc tgt aag gtg gcg gcg etc aat etc tec gaa cgt gag aat 
Lys Tyr Gly Cys Lys Val Ala Ala Leu Asn Leu Ser Glu Arg Glu Asn 
90 95 100 . 

gag cgt gat egg cag atg aat aag gag caa gga gtc gat cac ctg att 
Glu Arg Asp Arg Gin Met Asn Lys Glu Gin Gly Val Asp His Leu lie 
105 110 , 115 

gag gtt gtc gat get gec ttt gaa gat gtc ccc tat gat gat ggg gtg 
Glu Val Val Asp Ala Ala Phe Glu Asp Val Pro Tyr Asp Asp Gly Val 
120 



125 130 135 



att caa ccg ate etc gat agg ate cac etc gag acg atg ggc acg cct 
He Gin Pro lie Leu Asp Arg He His Leu Glu Thr Met Gly Thr Pro 
186 190 195 



1339 



13 87 



1435 



1483 



1531 



ttt gac ctg gtc tgg teg cag gac tec ttc etc cat agt ccg gat agg 
Phe Asp Leu Val Trp Ser Gin Asp Ser Phe Leu His Ser Pro Asp Arg 
140 " 145 150 

gaa cgc gta ctg egg gag get agt aga gtt ctg cgt tct ggg ggc gag 
Glu Arg Val Leu Arg Glu Ala Ser Arg Val Leu Arg Ser Gly Gly Glu 
155 160 . .165 



ttt ate ttc ace gat cca atg cag gcg gat gat tgc ccc gag ggg gtt 1579. 
Phe He Phe Thr Asp Pro Met Gin Ala Asp Asp Cys Pro Glu Gly Val 
170 ' 175 180 



1627 



aat ttc tat cgc 
Asn. Phe Tyr Arg 
200 



cag acc ctt agg gat ctg ggc ttt 
Gin Thr Leu Arg Asp Leu Gly Phe 
205 210 



gag gaa ate acc 
Glu Glu He Thr 
215 



1675 



WO 00/11142 



I 



18 
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ttc gaa gac cac acc cat cag ttg ccg cgc cat tac ggg cgc gtg cgc 1723 
Phe Glu Asp His Thr His Gin Leu Pro Arg His Tyr Gly Arg val Arg 
220 .225 230' ' 

cgt gag eta gat cgt cga gaa ggt gaa ctg cag ggg cat gtc teg gca 1771 
Arg Glu Leu Asp Arg Arg Glu Gly Glu Leu Gin Gly His Val Ser Ala 



235 



240 245 



gag tac ate gaa cgc atg aaa aat ggc c» gac cac tgg gtc aat ggc 1819 

Glu Tyr lie Glu Arg Met Lys Asn Gly Leu Asp His Trp Val Asn Gly 
250 255 260 

' ■- ' r 

ggt aac aag ggt tat etc acc tgg ggt ate ttt tac ttc cgc aaa ggg 1867 : 
Gly Asn Lys Gly Tyr Leu Thr Trp Gly lie Phe Tyr Phe Arg Lys Gly 
265 ■ 270 275 ' , 

tgatcactag agegctaaat cgcaggcgcg gtaattgtgc cgcgcttcgg gcgcatatgt 1927 
tttagcccag tggatttgea catagaaaag agtgaattct ggtttagegc tcgaataaac 1987 

tatagcaaag gagaaact atg 'act aag cga tat eta ttt acc tct gag teg 2038 

Met Thr Lys Arg Tyr Leu Phe Thr Ser. Glu Ser 
/ 1 5 .10 

•V 

gtc tct gaa ggc cac ccg gac aaa atg.gcc gac cag att teg gat gca 2086 
Val Ser Glu Gly His Pro Asp Lys Met Ala Asp Gin lie Ser Asp Ala 
' 15 ' 20 25 

ctg etc gat gag ttc ctg cgc cag gat cca aag teg egg gtg get gca 2134 
Leu Leu Asp Glu Phe Leu Arg Gin Asp Pro Lys Ser Arg Val Ala Ala 
30 .. 35 40 

gag acg atg att cag acc ggc atg gtt gta gtt gec ggc gaa att aag 2182 
Glu Thr Met He Gin Thr Gly Met Val Val Val Ala Gly Glu lie Lys 



WO 00/11142 



19 



PCT/EP99/06037 



45 50 55 

2230 



age aat gec aag ate aat gtt gag ccg ccg gta cgt gaa gtt gtc cgc 
60 



Ser Asn Ala Lys He Asn Val Glu Pro Leu Val^Arg Glu Val Val Arg^ 



gat ate ggc tac ace age tea gat atg ggc ttt gat gee gac ace tgt 
Asp He Gly Tyr Thr Ser Ser Asp Met Gly Phe Asp Ala Asp Thr Cys 
80 85 90 

gec gta etc aac gec etc ggc gag cag tec ccc gac ate aat caa ggc 
Ala Val Leu Asn Ala Leu Gly Glu Gin Ser Pro Asp He Asn Gin Gly 

' 95 ■ • . . 100 105 

gtt gac egg gaa gag gaa gag gag cag ggc gec ggt gac caa gga ctg 
Val Asp Arg Glu Glu Glu Glu Glu Gl.i Gly Ala Gly Asp Gin Gly Leu 
110 115. 12° 

atg ttc ggt tac gec acc aat gag ace gac gtc etc atg ccg gca gcg 
Met" Phe Gly Tyr Ala Thr Asn Glu thr Asp Val Leu Met Pro Ala Ala 
125 130 135 



ate cac tac teg cac ctg ctg gtt aag cgc caa tec. gaa gtc cgt aac 

He His Tyr Ser His Leu Leu Val Lys Arg Gin Ser Glu Val Arg Asn 

140 "5 150 155 

tec aag aag ctg ccg tgg ttg cgc ccg gac get aag age cag gtc acc 

Ser Lys Lys Leu Pro Trp Leu Arg Pro Asp Ala Lys Ser Gin Val Thr 

160 165 170 



2278 



2326 



2374 



2422 



2470 



2518 



ttc aag tac 
Phe Lys Tyr 



gaa ggc gat aag 
Glu Gly Asp Lys 
175 



ate gtc ggt tgc gat gcg gtg gtg etc 
lie Val Gly Cys Asp Ala Val Val Leu 
180 IBS 



2566 



WO 00/11142 PCT/EP99/06037 
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aaa gat ccg tct aag gtc gat cgc teg gcg get tat gtt ggc cgt tat 
Lys Asp Pro Ser Lys Val Asp Arg Ser Ala Ala Tyr Val Gly Arg Tyr 
270 275 280 

gta get aag aac ata gtg get gee ggt ctg gee gac cgc tgt gag gtg 
Val Ala Lys Asn lie Val Ala Ala Gly Leu Ala Asp Arg Cys Glu Val 
265 290 295 

cag etc tec tac gee ate ggt gtt gee gag ccg ace teg gtc aat gtc 
Gin Leu Ser Tyr Ala lie Gly Val Ala Glu Pro Thr Ser Val Asn Val 
300 305 310 315 

gaa acc ttt ggt ace ggc aag gtc gaa gaa gag etc gcg cac gat ggc 
Glu Thr Phe Gly Thr Gly Lys Val Glu Glu Glu Leu Ala His Asp Gly 



2614 



2662 



2710 



tec beg cag cat gac gag acc gtt gat cag aag acc gtc cac gag ggc 
Ser Thr Gin His Asp Glu Thr Val Asp Gin Lys Thr Val His Glu Gly 
190 ' 195 ' . 200 

gtg atg gaa gag ate -ate aag ccc ate etc ggt gac acc ggc tgg ctg 
Val Met Glu Glu lie lie Lys Pro lie Leu Gly Asp Thr Gly Trp Leu 
205 .210 215. 

acc aac gag acc aag tac cac att aac ccg acc ggc cgc .tec get act 
Thr Asn Glu Thr Lys Tyr His lie Asn Pro Thr Gly Arg Phe Val Thr 
220 , 225 230 , 235 

ggt ggt ccg ttg ggt gac tgt ggc ttg acc ggc cgt aag ate ate gtc 
Gly Gly Pro Leu Gly Asp Cys Gly Leu Thr Gly Arg Lys lie lie Val 

240 .. 245 250 



gat acc tac ggc ggt atg ggc cgt cac ggc ggc ggt gec ttc tec ggc 2806 
Asp Thr Tyr Gly Gly Met Gly Arg His Giy Gly Gly Ala Phe Ser Gly 
255 260 265 



2758 



2854 



2902. 



2950 



2996 
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320 325 * 330 



cag ccc 
Gin Pro 



taagccgtgg ccgctgcgcg agccctggta gccggcgcga tcgaacgccg 3054 



caaagatatg tgcctgctgc tctgggggga tccccggacc ggtatcgcgc acttggaagg 3114 

tgattcggtc atcgagctcc accgcggtgg cggccgctct agaactagtg gatcccccgg 3174 

i 

gctgcaggaa ttcgatatca agcttatcga taccgtcgac ctcga . 3219 



<210> 6 

<211> 268 - - r 

<212> PRT 

<213> Ectothiorhodospira halochloris ^ 

" . r ' •' " 

<400> 6 . 

Met Asn Thr Thr Thr Glu Gin Asp Phe Gly Ala Asp Pro Thr Lys Val 
. 1 ' 5 10 " 

Arg Asp Thr Asp His Tyr Thr Glu Glu Tyr Val Asp Gly Phe Val Asp 
20 25 30 , 

Lys Trp Asp Asp Leu lie Asp Trp Asp Ser Arg Ala Lys Ser.Glu Gly 
1 35 40 45 

Asp Phe Phe lie Gin Glu Leu Lys Lys Arg Gly Ala Thr Arg lie Leu 
50 55 60 

• Asp Ala Ala Thr Gly Thr Gly Phe His Ser Val Arg Leu Leu Glu Ala 
65 70 75 , 80 
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22 

Gly Phe Asp val va'l Ser Ala Asp Gly Ser Ala Glu Met Leu Ala Lys . 

85 9.0 . • 95 

Ala Phe Glu Asn Gly Arg Lys Arg Gly His He Leu Arg Thr Val Gin 
. 100 105 ' HO . 



Val Asp 



Trp Arg Trp Leu Asn Arg Asp He His Gly Arg Tyr Asp Ala 
115 120 • 125 



le lie Cys Leu Gly Asn Ser Phe Thr His Leu Phe Asn Glu Lys Asp 
130 



I 

135 , . . I" 



Arg Arg Lys Thrj Leu Ala Glu Phe Tyr Ser Ala Leu Asn Pro Glu Gly 
145 



150 155 160 



He Leu Asp Gin Arg Asn Tyr Asp Gly He Leu Asp His Gly 



Val Leu 

165 



170 1 7 5 



Tyr Asp Ser Ser His Ser Tyr Tyr Tyr Cys Gly Glu Gly Val Ser Val 



180 



185 19° 



Tyr Pro Glu His Val Asp Asp Gly , Leu Ala Arg Phe Lys Tyr Glu Phe 
195 200 205 



Asn 



Tyr 
225 



Asp Gly Ser Thr Tyr Phe Leu Asn Met Phe Pro Leu 'Arg Lys Asp 
210 215 220 

Thr Arg Arg Leu Met His Glu Val Gly Phe Gin Lys lie Asp Thr 
230 235 240 



Tyr Gly Asp Phe Lys Ala Thr Tyr Arg Asp Ala Asp Pro Asp Phe Phe 
245 250 255 , 



lie His Val Ala Glu Lys Glu Tyr Arg Glu Glu Asp 
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* 260 265 



<210> 7 
<211> 279 
<212> PRT 

<213> Ectothiorhodospira halochloris 

<400> 7 " - 

Met Ala Thr Arg Tyr Asp Asp Gin Ala lie Glu Thr Ala Arg Gin Tyr 

. x .5 io . -as 

Tyr Asn Ser Glu Asp Ala Asp Asn. Phe Tyr Ala lie lie Trp Gly Gly 

20 , 25 \ 30 ' 

Glu Asp lie His lie Gly Leu Tyr Asn Asp Asp Glu Glu Pro lie Ala 

35.. 40 '-, 45 ■ . ■ ' 

Asp Ala Ser Arg Arg Thr Val Glu Arg Met Ser Ser Leu Ser Arg Gin 

so 55 : 60 ' ' . I 



Leu Gly Pro Asp Ser Tyr Val Leu Asp Met Gly Ala Gly Tyr Gly^Gly 
65 



70 .' 75 80 



Ser Ala Arg Tyr Leu. Ala His Lys Tyr Gly Cys Lys Val Ala Ala Leu 
85 90 95 



Asn Leu Ser Glu Arg Glu Asn Glu Arg Asp Arg Gin Met Asn Lys Glu 
.100 105 HO 



Gin Gly Val Asp. His Leu lie 'Glu Val Val Asp Ala Ala Phe Glu Asp 
115 120 , ; 125 

Val Pro Tyr Asp Asp Gly Val Phe Asp Leu Val Trp Ser Gin Asp Ser 



V 



c 
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130 "5 140 



Phe Leu His Ser Pro Asp Arg Glu Arg Val Leu Arg Glu Ala Ser Arg 

Val Leu Arg Ser Gly Gly Glu Phe lie Phe Thr Asp Pro Met Gin Ala 
165 170 175 

Asp Asp Cys Pro Glu Gly Val lie Gin Pro lie Leu Asp Arg>Ile His 
180 185 190 



Thr Met Gly Thr Pro Asn Phe Tyr Arg Gin Thr Leu Arg Asp 
195 



Leu Glu x 

200 205 



Leu Gly Phe Glu Glu lie Thr , Phe Glu Asp His Thr His Gin Leu Pro 
210 215 220 < 

Arg His Tyr Gly Arg Val Arg Arg Glu Leu Asp Arg Arg Glu Gly Glu 
225 ; "0 235 

i cm Gly His Val Ser Ala Glu Tyr He Glu Arg Met Lys As n Gly 



Leu 

f" 245 



250 255 



His Trp Val Asn Gly Gly Asn Lys Gly Tyr Leu Thr Trp Gly 



Leu Asp 

260 s I 

He Phe Tyr Phe Arg Lys Gly 
275 



265 - 270 



<210> 8 

<211> 333 

<212> PRT 

<213> Ectothiorhodospira halochloris 
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<400> 8 

Met Thr Lys Arg Tyr Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His 
1 ■ 5 



10 



15 



Pro Asp Lys Met Ala Asp Gin He Ser Asp Ala Leu Leu Asp Glu Phe 
20 25 30 



Leu Arg Gin Asp Pro Lys Ser Arg Val Ala Ala Glu Thr Met lie Gin 

40 . 45 



35 



Thr Gly Met Val Val Val Ala Gly Glu lie Lys Ser Asn Ala Lys lie 

55 60 



SO 



Asn 
65 



Val Glu Pro Leu Val Arg Glu Val Val Arg Asp lie Gly. Tyr Thr 
70 75 80 



Ser Ser Asp Met Gly Phe Asp Ala Asp Thr Cys Ala Val Leu Asn Ala 
85 ,90 95 

Leu Gly Glu Gin Ser Pro Asp He Asn Gin Gly Val Asp Arg Glu Glu 
100 105 HO 

Glu Glu Glu Gin Gly Ala Gly Asp Gin Gly Leu Met Phe Gly Tyr Ala 
H5 120 125 

Thr Asn Glu Thr Asp Val Leu Met . Pro Ala Ala lie His Tyr Ser His 
130 135 140 

Leu Leu Val Lys Arg Gin Ser Glu Val Arg Asn Ser Lys Lys Leu Pro 
145 ,150 155 160 



Trp Leu Arg Pro Asp Ala Lys Ser Gin Val Thr Phe Lys Tyr Glu Gly 

170 175 



165 
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Asp Lys lie Val Gly Cys Asp Ala Val Val Leu Ser Thr Gin His Asp 
180 185 190 

Glu Thr Val Asp Gin Lys Thr Val His Glu Gly Val Met Glu Glu lie 
195 200 205 



lie Lys Pro lie Leu Gly Asp Thr Gly Trp Leu Thr Asn Glu Thr Lys 
210 



215 220 



Tyr His lie Asn Pro Thr Gly Arg Phe Valjhr Gly Gly Pro Leu Gly 
225 , ' " 230 235 < 240 

Asp Cys Gly Leu Thr Gly Arg Lys lie lie Val Asp Thr Tyr Gly Gly 
245 250 255 



Met Gly Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys. 
260 265 270 , 

Val Asp Arg Ser Ala Ala Tyr Val Gly Arg Tyr Val Ala Lys Asn lie 
275 280 285. 

Val Ala Ala Gly Leu Ala Asp Arg Cys Glu Val Gin Leu Ser Tyr Ala 
290 295 300 

' ] 

lie Gly Val Ala Glu Pro Thr Ser Val Asn Val Glu Thr Phe Gly Thr 
3 05 310 315 320 

Gly Lys Val Glu Glu Glu Leu Ala His Asp Gly Gin Pro 
325 330 



<210> 9 
<211> 37 
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<212> PRT 

<213> Artificial Sequence 

<220> ' 
<223> Description of Artificial Sequence: peptide 

<400> 9 

Glu Lys Ser Tyr Arg Thr Glu Asp Glu Phe Val Asp Met Tyr Ser Asn 
Ala Val His Thr Ala Arg Asp Tyr Tyr Asn Ser Glu Asp Ala Ser Asn 



20 

Phe Tyr Tyr His Val 

35. 



25 30 



<210> 10 
<211> 15 
<212> PRT 

<213> Artificial Sequence " 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 10 

Gly Ser Val Leu Phe Thr Asp Pro Met Ala Ser Asp Asp Ala Lys 
. 1 . 5 ' 10. 15 



<210> 11 

<211> 10 ^ 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: peptide 



<400> 11 

Thr Gly Leu Arg Asn Tyr Gin Ala Gly Asn 

i : 5 10 

<210> 12 
<211> 16 
<212> PRT 

. <213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<40O> 12 

Leu Xaa Glu Leu Gly Pro lie Leu Asp- Arg Leu His Leu Asp Ser Gly 
c 10 15 



<210> 13 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 13 

Glu Leu Thr Arg Leu Gly Leu Gin Asn lie Glu Phe Glu Asp Leu Ser 
X 5 10 15 
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Glu Tyr Leu Pro Val His Tyr -Gly Arg 
20 25 



<210> 14 
<211> 18 
<212> PRT 

<213> Artificial Sequence 

<22o> . ^ : 

<223> Description of Artificial Sequence: peptide 

<400> 14 1 
Val A?p He Ser Pro Glu Thr Arg He Leu Asp Leu Gly Ser Gly Tyr 
r 5 10 - " 

Gly Ala 



<210> 15 . 1 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> . y 

<223> Description of Artificial Sequence: peptide 
<220> 

'<221> SITE 
<222> (4) 

<223> Xaa « Thr or Glu 
<400> 15 

/ 



WO 00/11142 PO7EP99/06037 

30, 



Asn 
1 



Thr Thr Xaa Glu Gin Asp Phe Gly Ala Asp Pro Thr Lys Val Arg 
5 .10 15 



Asp^Thr Asp Ala Tyr Thr Glu 
20 



<210> 16 

<211> 13 \ 
<212> PRT , J 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: peptide 
<400> 16 

Val Arg Asp Thr Asp His Tyr Thr Glu Glu Tyr Val Asp 
5 10 



<210> 17 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 17 

Asp iyr Thr Arg Arg Leu Met His Glu Val Gly Phe Gin Lys 
i 5 10 



<210> 18 
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<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 18 . 

Ala Thr Tyr Arg Asp Ala Asp Pro Asp Phe Phe Leu His Val Ala Glu 
* 10 15 



Lys 



<210> 19 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 1? 

Val Arg Asp Thr Asp His Tyr Thr 
1 5 

Asp Lys Trp Asp Asp Leu lie Asp 
20 



Sequence: peptide 

Glu Glu Tyr Val Asp Gly Phe Val 
10 15 



<210> 20 
<211> 22 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<220> 

<221> modif ied^base 
<222> (15) 
<223> i 



<400> 20 

gargaygart tygtngayat gt 



<210> 21 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> N 



<223> Description of Artificial Sequence: synthetic DNA 
<400> ytgrttdatytcraaytcrtc 



<2l6> 22 
<211> 23 

<212> DNA ^ 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic DNA. 



22 



21 



<220> 
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<221> modif ied_base 
<222> (15) 
<223> i 

v 

<400> 22 

garcargayt tyggngcnga ycc 

<210> 23 

■ V 
<211> 23 

<212> DNA ' ' .' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
<220> 

<221> modified_base 
<222> (12) 
<223> i 

<400> 23 

arraaraart cnggrtcngc rtc 

<210> 24 
<211> 27 
<212> DNA 

<213> Artificial Sequence v 
<220> 

<223> Description of Artificial Sequence: synthetic 



<400> 24 
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27 

cggaccatgg atacgactac. tgagcag 



<210> 25 
<2X1> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 25 

gctcagatct gtcctcctcc cgatattcct tctc 34 



<210> 26 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 26 

31 



gcatgccatg gcgacgcgct acgacgatca a 



<210> 27 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 



i 



WO 00/11142 
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<400> 27 

gggaagatct ccctttgcgg aagtaaaaga tacc * 

i \ ■ ■ * 

<210> 28 
<211> 30 

<212> DNA j 

<213> Artificial Sequence . 

. / ■ ' , 

<220> 

<223> Description of Artificial Sequence: synthetic DMA 
<400>,28 

gctgccatgg agaagagcta ccgcaccgag 

<210> 29 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<220> •' 

<223>, Description of Artificial Sequence: synthetic DNA 
<400> 29 

gggaagatct tgccctggcg tggatgatgc ccca . 

<210> 30 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
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<220> . - \ ' 

<223> Description of Artificial Sequence: synthetic DNA ( 

<400> 30' 

catgccatgg ccaagagcgt ggacgatctt 30 

S 



\ 



